SlideShare a Scribd company logo
1 of 36
Download to read offline
© 2022. Access Innovations, Inc. All rights reserved.
Access Innovations, Inc.
Marjorie M.K. Hlava
mhlava@accessinn.com
Jay Ven Eman
j_ven_eman@accessinn.com
www.accessinn.com
www.dataharmony.com
+1.505.998.0800
Albuquerque, NM
Leveraging Your Content
Semantically
Where’s the one about…
Looney Tunes® Revisited
October 10, 2022
Wondering and wandering!
How do you find information…
when you don’t know what you want,
what it might be called,
where to look?
I used to wander the stacks…
Long Library, Trinity U., Dublin
Guinness Brewery
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Albuquerque, New Mexico Las Vegas, New Mexico
Las Vegas, Nevada
Background
What if there is sparse metadata unlike library
catalog cards?
Video?
- Notorious for no metadata
- Maybe a title
- Newspaper ‘slug’
Where’s the one about…
Daffy Duck and Donald Duck and pianos?
What was the one about…
Bugs Bunny - opera singer?
© Warner Bros.
Do you recall the one about…
a coyote and the what was it?
© Warner Bros.
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Our neighborhood Coyote
The Road Runner
Questions to ponder
❖ How do you find what you’re looking for?
❖ How do you know what you want?
❖ How do you know you found it?
❖ How do you know, if you’ve missed
something?
❖ How do you replicate wandering the
stacks in the Age of Google?
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
And now…
Case studies by
Marjorie Hlava
Case Study
❖ Access Innovations, Inc.
❖ Changing ‘search’ to ‘found’
❖ Why we do it – the problem
❖ How we do it – the solution
❖ Case study on metadata for video
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Clients
Publishing &
Media
Education
Government
Non-profits
& Societies
Health/Pharma
Manufacturing
& Retail
Promising
solutions for
improving the
accuracy of
content
metadata
❖ Standards - Check out the NISO Library on their
web site
❖ Consortiums for clean data
❖ Share, check, and enhance metadata
❖ Automate as much of manuscript submission &
peer review as possible
❖ Clean up the author synonymy
❖ Enhance your content and the audiences it
represents worldwide
❖ THE GOAL
❖ High integrity, accurate, consistent content
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Semantic control and content
enrichment
❖ Controlled vocabularies, authority files,
taxonomies, thesaurus, ontologies, triple
stores, and knowledge graphs
❖ Follow the standards
▪ Accepted Structure and Format Use
• ANSI/NISO Z39.19
• ISO2788
• BS5723
• ISO25964 Parts 1 and 2
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Every Walk of Life Uses
Constantly Changing Vernacular
• Homeless
• Unsheltered
• Unhoused
• Street people
• Hobos
• Vagrants
• ….
• Taxonomy
• Ontology
• Thesaurus
• Knowledge Map
• Metastatic breast
cancer
• Stage IV Breast
Cancer
• Invasive Breast
Cancer
• Covid-19
• Coronavirus
• SARS-CoV-2
• Omicron
• BA.4
• BA.5
• …..
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Differences in search results due to synonymy
❖ Invasive breast cancer: 520 results
❖ Metastatic breast cancer: 1803 results
❖ Stage IV breast cancer: 73 results
❖ Stage IV breast cancer: 46,400,000 results
Lack of Synonymy Control
Breaks Search
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
But How About Improving the Content Itself?
❖ Series of Metadata Filters and Enrichment
▪ The varying names used in the content of the publication
▪ Gene names – 19 or more synonyms per name
▪ Medicinal Plant names – nearly 17 synonyms per name
▪ Bad Cell Line references
▪ Suspect Science topics / Fake news
❖ Semantic enrichment supports metadata and search
❖ Time savings for researchers both authors and readers
❖ It allows the disambiguated information in the formation of a
platform for better science
❖ Being able to reference a widely available authoritative source is
crucial to all world health
Atypon
Production
How is it done
10/11/2022
Provisional
Acceptance
Article
Submission
Revision
Review –
Link to
Portal
Web based
Deputy
Editor Key
Term
Review
Portal
Review, add,
delete, submit
Key Term
update in
article XML
New Taxonomy
terms
New
Taxonomy
updated
SKOS file
Accept
Article XML
Concept
Taxonomy
MPNS
Name
verification
Taxogene
Human
Genome
Tagging
Suspect
Science
Filter
Bad Cell Lines
Identification
SciGen
Identification
After Todd Ware of ACP
ICD_10
CPT
HCPCS
Coding
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
TaxoGene
❖ Automatically find all synonyms and insert the
consensus approved name.
❖ Special characters and extensions
❖ Directing all readers to the preferred name in
either search or publication allows
full retrieval recall of related material
insures precision in search
remove ambiguity in communication
10/11/2022
• Synonymy: Average of 19 synonyms per
gene name
• Sources:
• Human Genome Project
• https://www.ncbi.nlm.nih.gov/genome/guide/
human/
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Medicinal Plant Names
Service (MPNS)
❖ How many kinds of “Ginger” are there??
▪ At least 42
❖ Better communication between researchers worldwide – no misidentification
❖ Link to full plant name record at MPNS
❖ Includes all known scientific names, common names, homonyms,
and more
❖ Global coverage – not just regional which is important an
integrated world
❖ Constantly updated and linked to the
▪ Kew International Plants Names Database.
▪ International Plant Names Index (IPNI)
10/11/2022
• Source Data: The Royal Botanical Gardens at Kew
• Synonymy: Average of over 16 names per plant
are used.
• Includes all known scientific names, common
names, homonyms, and more
• www.kew.org/mpns
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
❖ ICLAC - There are about 437 cell lines, which are documented as such
and we mine those to highlight misuse….
❖ List of known contaminated cell lines (many of them invaded by HeLa
cells)
❖ Don’t let your authors and researchers work with known bad data.
▪ Over 32,000 papers that have worked on the wrong cells
▪ Cited by at least 500,000 more articles,
▪ https://blogs.sciencemag.org/pipeline/archives/2017/10/20/bad-
cells-so-many-bad-cells
10/11/2022
Sources: 488 from ICLAC
https://iclac.org/databases/cross-contaminations/
757 from Swiss Institute of Bioinformatics (SIB)
https://en.wikipedia.org/wiki/Cellosaurus
https://en.wikipedia.org/wiki/List_of_contaminated_c
ell_lines
Offering:
A rule base to quickly verify that
the cell lines used are valid and
not a contaminated line
Bad Cell Lines
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Suspect Science Filter
❖ List of topics which require a closer look by acquisitions
editors before sending out to potential peer reviewers
❖ Identifies questionable articles
❖ Autism and vaccination
▪ Flag for assessment before sending to peer review
❖ Saves time in acquisitions review
❖ Auto Identify at time of submission using a rule base
10/11/2022
Source: PLOS in conjunction with Access Innovations
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Access Integrity Coding
❖ Medical Coding – automatically for articles and reports
etc.
❖ ICD-10 The international Classification of Diseases.
!78,000 codes to give full details to medical
professionals on where that article or report falls within
medical diagnosis and procedures.
❖ CPT from the American Medical Association for
Classification of Procedures and Techniques
❖ HCPCS also from the AMA to find the illusive materials
and supplies needed to support this item described.
10/11/2022
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Taxonomy Links in the PLOS Editorial Workflow
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Increasing Audio Video Access
❖ The new horizon is indexing audio
video content to make it accessible
▪ Conference proceedings
▪ Demonstrations, interviews online
▪ Lab experiments
❖ All disappear without tagging of the
content
❖ Metadata without subject metadata
does not give you access to the
content (What was that about?)
❖ Add taxonomy terms to the audio layer
using transcription via auto tagging
▪ USPTO case study
❖ VATT™ – video to text and tagging
from Data Harmony®
© 2010. Access Innovations, Inc. All Rights Reserved.
© 2022. Access Innovations, Inc. All Rights Reserved.
Fiscal Impacts From
Semantically Enriching your
Content
❖ MUST use at both input and search
❖ 34% improvement in search
▪ With just semantic enrichment
▪ Ying-Hsang Liu , DC 2016, Copenhagen, Denmark
❖ 75% higher book sales with more complete metadata
▪ NIELSEN BOOK US STUDY: THE IMPORTANCE OF METADATA FOR
DISCOVERABILITY AND SALES ,
▪ David, Senior Director, Client Solutions, Nielsen Book’s Research and
Commerce Solutions Published in the US December 31, 2016
Metadata is
the Key!
© 2022. Access Innovations, Inc. All rights reserved.
Access Innovations, Inc.
Marjorie M.K. Hlava
mhlava@accessinn.com
Jay Ven Eman
j_ven_eman@accessinn.com
www.accessinn.com
www.dataharmony.com
+1.505.998.0800
Albuquerque, NM
Leveraging Your Content
Semantically
Where’s the on about…
Looney Tunes® Revisited
October 10, 2022
Thank you!

More Related Content

Similar to AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA)

Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Susanna-Assunta Sansone
 
2. ratner orcid getting to launch v5
2. ratner orcid getting to launch v52. ratner orcid getting to launch v5
2. ratner orcid getting to launch v5
ORCID, Inc
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lee Dirks
 

Similar to AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA) (20)

AlexanderStreet_17April2015
AlexanderStreet_17April2015AlexanderStreet_17April2015
AlexanderStreet_17April2015
 
SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
ORCID Implementation in Open Access Repositories and Institutional Research I...
ORCID Implementation in Open Access Repositories and Institutional Research I...ORCID Implementation in Open Access Repositories and Institutional Research I...
ORCID Implementation in Open Access Repositories and Institutional Research I...
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...
 
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Metadata Ownership & Metadata Rights
Metadata Ownership & Metadata RightsMetadata Ownership & Metadata Rights
Metadata Ownership & Metadata Rights
 
2. ratner orcid getting to launch v5
2. ratner orcid getting to launch v52. ratner orcid getting to launch v5
2. ratner orcid getting to launch v5
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Qualifying Online Information Resources for Chemists
 
Thinking about resource issues: copyright and open access
Thinking about resource issues: copyright and open accessThinking about resource issues: copyright and open access
Thinking about resource issues: copyright and open access
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
 

More from Dr. Haxel Consult

AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
Dr. Haxel Consult
 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
The Artificial Intelligence Conference on Search, Data and Text Mining, Analy...
 

Recently uploaded

audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
lolsDocherty
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
ChloeMeadows1
 

Recently uploaded (17)

Thank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirtsThank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirts
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
 
Bug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideBug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's Guide
 
AI Generated 3D Models | AI 3D Model Generator
AI Generated 3D Models | AI 3D Model GeneratorAI Generated 3D Models | AI 3D Model Generator
AI Generated 3D Models | AI 3D Model Generator
 
Statistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfStatistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdf
 
Development Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of appsDevelopment Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of apps
 
I’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 ShirtI’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 Shirt
 
Premier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfPremier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdf
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
 
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
 
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresenceCyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
 
Reggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirts
 
GOOGLE Io 2024 At takes center stage.pdf
GOOGLE Io 2024 At takes center stage.pdfGOOGLE Io 2024 At takes center stage.pdf
GOOGLE Io 2024 At takes center stage.pdf
 
The Rise of Subscription-Based Digital Services.pdf
The Rise of Subscription-Based Digital Services.pdfThe Rise of Subscription-Based Digital Services.pdf
The Rise of Subscription-Based Digital Services.pdf
 
Registry Data Accuracy Improvements, presented by Chimi Dorji at SANOG 41 / I...
Registry Data Accuracy Improvements, presented by Chimi Dorji at SANOG 41 / I...Registry Data Accuracy Improvements, presented by Chimi Dorji at SANOG 41 / I...
Registry Data Accuracy Improvements, presented by Chimi Dorji at SANOG 41 / I...
 
iThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWebiThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWeb
 
Free scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirtsFree scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirts
 

AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA)

  • 1. © 2022. Access Innovations, Inc. All rights reserved. Access Innovations, Inc. Marjorie M.K. Hlava mhlava@accessinn.com Jay Ven Eman j_ven_eman@accessinn.com www.accessinn.com www.dataharmony.com +1.505.998.0800 Albuquerque, NM Leveraging Your Content Semantically Where’s the one about… Looney Tunes® Revisited October 10, 2022
  • 3. How do you find information… when you don’t know what you want, what it might be called, where to look? I used to wander the stacks…
  • 5.
  • 6.
  • 8. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Albuquerque, New Mexico Las Vegas, New Mexico Las Vegas, Nevada
  • 9.
  • 11. What if there is sparse metadata unlike library catalog cards? Video? - Notorious for no metadata - Maybe a title - Newspaper ‘slug’
  • 12. Where’s the one about… Daffy Duck and Donald Duck and pianos?
  • 13. What was the one about… Bugs Bunny - opera singer? © Warner Bros.
  • 14. Do you recall the one about… a coyote and the what was it? © Warner Bros.
  • 15. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Our neighborhood Coyote
  • 17. Questions to ponder ❖ How do you find what you’re looking for? ❖ How do you know what you want? ❖ How do you know you found it? ❖ How do you know, if you’ve missed something? ❖ How do you replicate wandering the stacks in the Age of Google?
  • 18. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. And now… Case studies by Marjorie Hlava
  • 19. Case Study ❖ Access Innovations, Inc. ❖ Changing ‘search’ to ‘found’ ❖ Why we do it – the problem ❖ How we do it – the solution ❖ Case study on metadata for video
  • 20. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Clients Publishing & Media Education Government Non-profits & Societies Health/Pharma Manufacturing & Retail
  • 21. Promising solutions for improving the accuracy of content metadata ❖ Standards - Check out the NISO Library on their web site ❖ Consortiums for clean data ❖ Share, check, and enhance metadata ❖ Automate as much of manuscript submission & peer review as possible ❖ Clean up the author synonymy ❖ Enhance your content and the audiences it represents worldwide ❖ THE GOAL ❖ High integrity, accurate, consistent content
  • 22. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Semantic control and content enrichment ❖ Controlled vocabularies, authority files, taxonomies, thesaurus, ontologies, triple stores, and knowledge graphs ❖ Follow the standards ▪ Accepted Structure and Format Use • ANSI/NISO Z39.19 • ISO2788 • BS5723 • ISO25964 Parts 1 and 2
  • 23. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Every Walk of Life Uses Constantly Changing Vernacular • Homeless • Unsheltered • Unhoused • Street people • Hobos • Vagrants • …. • Taxonomy • Ontology • Thesaurus • Knowledge Map • Metastatic breast cancer • Stage IV Breast Cancer • Invasive Breast Cancer • Covid-19 • Coronavirus • SARS-CoV-2 • Omicron • BA.4 • BA.5 • …..
  • 24. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Differences in search results due to synonymy ❖ Invasive breast cancer: 520 results ❖ Metastatic breast cancer: 1803 results ❖ Stage IV breast cancer: 73 results ❖ Stage IV breast cancer: 46,400,000 results Lack of Synonymy Control Breaks Search
  • 25. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. But How About Improving the Content Itself? ❖ Series of Metadata Filters and Enrichment ▪ The varying names used in the content of the publication ▪ Gene names – 19 or more synonyms per name ▪ Medicinal Plant names – nearly 17 synonyms per name ▪ Bad Cell Line references ▪ Suspect Science topics / Fake news ❖ Semantic enrichment supports metadata and search ❖ Time savings for researchers both authors and readers ❖ It allows the disambiguated information in the formation of a platform for better science ❖ Being able to reference a widely available authoritative source is crucial to all world health
  • 26. Atypon Production How is it done 10/11/2022 Provisional Acceptance Article Submission Revision Review – Link to Portal Web based Deputy Editor Key Term Review Portal Review, add, delete, submit Key Term update in article XML New Taxonomy terms New Taxonomy updated SKOS file Accept Article XML Concept Taxonomy MPNS Name verification Taxogene Human Genome Tagging Suspect Science Filter Bad Cell Lines Identification SciGen Identification After Todd Ware of ACP ICD_10 CPT HCPCS Coding
  • 27. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. TaxoGene ❖ Automatically find all synonyms and insert the consensus approved name. ❖ Special characters and extensions ❖ Directing all readers to the preferred name in either search or publication allows full retrieval recall of related material insures precision in search remove ambiguity in communication 10/11/2022 • Synonymy: Average of 19 synonyms per gene name • Sources: • Human Genome Project • https://www.ncbi.nlm.nih.gov/genome/guide/ human/
  • 28. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Medicinal Plant Names Service (MPNS) ❖ How many kinds of “Ginger” are there?? ▪ At least 42 ❖ Better communication between researchers worldwide – no misidentification ❖ Link to full plant name record at MPNS ❖ Includes all known scientific names, common names, homonyms, and more ❖ Global coverage – not just regional which is important an integrated world ❖ Constantly updated and linked to the ▪ Kew International Plants Names Database. ▪ International Plant Names Index (IPNI) 10/11/2022 • Source Data: The Royal Botanical Gardens at Kew • Synonymy: Average of over 16 names per plant are used. • Includes all known scientific names, common names, homonyms, and more • www.kew.org/mpns
  • 29. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. ❖ ICLAC - There are about 437 cell lines, which are documented as such and we mine those to highlight misuse…. ❖ List of known contaminated cell lines (many of them invaded by HeLa cells) ❖ Don’t let your authors and researchers work with known bad data. ▪ Over 32,000 papers that have worked on the wrong cells ▪ Cited by at least 500,000 more articles, ▪ https://blogs.sciencemag.org/pipeline/archives/2017/10/20/bad- cells-so-many-bad-cells 10/11/2022 Sources: 488 from ICLAC https://iclac.org/databases/cross-contaminations/ 757 from Swiss Institute of Bioinformatics (SIB) https://en.wikipedia.org/wiki/Cellosaurus https://en.wikipedia.org/wiki/List_of_contaminated_c ell_lines Offering: A rule base to quickly verify that the cell lines used are valid and not a contaminated line Bad Cell Lines
  • 30. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Suspect Science Filter ❖ List of topics which require a closer look by acquisitions editors before sending out to potential peer reviewers ❖ Identifies questionable articles ❖ Autism and vaccination ▪ Flag for assessment before sending to peer review ❖ Saves time in acquisitions review ❖ Auto Identify at time of submission using a rule base 10/11/2022 Source: PLOS in conjunction with Access Innovations
  • 31. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Access Integrity Coding ❖ Medical Coding – automatically for articles and reports etc. ❖ ICD-10 The international Classification of Diseases. !78,000 codes to give full details to medical professionals on where that article or report falls within medical diagnosis and procedures. ❖ CPT from the American Medical Association for Classification of Procedures and Techniques ❖ HCPCS also from the AMA to find the illusive materials and supplies needed to support this item described. 10/11/2022
  • 32. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Taxonomy Links in the PLOS Editorial Workflow
  • 33. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Increasing Audio Video Access ❖ The new horizon is indexing audio video content to make it accessible ▪ Conference proceedings ▪ Demonstrations, interviews online ▪ Lab experiments ❖ All disappear without tagging of the content ❖ Metadata without subject metadata does not give you access to the content (What was that about?) ❖ Add taxonomy terms to the audio layer using transcription via auto tagging ▪ USPTO case study ❖ VATT™ – video to text and tagging from Data Harmony®
  • 34. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Fiscal Impacts From Semantically Enriching your Content ❖ MUST use at both input and search ❖ 34% improvement in search ▪ With just semantic enrichment ▪ Ying-Hsang Liu , DC 2016, Copenhagen, Denmark ❖ 75% higher book sales with more complete metadata ▪ NIELSEN BOOK US STUDY: THE IMPORTANCE OF METADATA FOR DISCOVERABILITY AND SALES , ▪ David, Senior Director, Client Solutions, Nielsen Book’s Research and Commerce Solutions Published in the US December 31, 2016
  • 36. © 2022. Access Innovations, Inc. All rights reserved. Access Innovations, Inc. Marjorie M.K. Hlava mhlava@accessinn.com Jay Ven Eman j_ven_eman@accessinn.com www.accessinn.com www.dataharmony.com +1.505.998.0800 Albuquerque, NM Leveraging Your Content Semantically Where’s the on about… Looney Tunes® Revisited October 10, 2022 Thank you!