SlideShare a Scribd company logo
1 of 26
Download to read offline
IMPLEMENTING
SEARCH ENHANCEMENTS
Managing Information Overload
Markus Kaindl
26 August 2020
NISO Virtual Conference
Topic: Transforming Search
Your Speaker
on LinkedIn.
As a father of two and tech
enthusiast I fully embrace
knowledge dissemination.
2
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
1 Information Overload Challenges
2 COVID-19 - From Report to App
3 TL;DR Extreme Summarization
4 Natural Language Generation
5 Intelligent Question Answering
6 Wrap-Up: Re-Imagining Re-Search
CONTENTS
3
INFORMATION
OVERLOAD
CHALLENGES
1
4
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
OUR BRANDS PUBLISH
A LOT OF RESEARCH!
OUR RESEARCH IMPRINTS
5
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
MANAGING THE INFORMATION OVERLOAD
Addressing it in our role as a publisher in various ways.
ANSWER
PUBLISHING
QUESTIONS
● Engage with our communities
● Shape the future of publishing
HELP
RESEARCHERS
● Save time entering new field
● Save time staying up tp date
● Help to overcome research silos
PROVIDE
SERVICE TO
SN AUTHORS
● Instant literature overview
● Automated ToC generation
● Text generation
STRUCTURE
EXISTING
CONTENT
● Develop and offer new products
like machine-generated books,
reports or applications
● Auto-clustering of subject areas
6
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
New solutions need to tame the coronavirus literature.
“The COVID-19 literature has grown in much
the same way as the disease’s transmission:
exponentially. The NIH’s COVID-19 Portfolio, a
website that tracks papers related to the
SARS-CoV-2 coronavirus and the disease it
causes, lists more than 28,000 articles — far
too many for any researcher to read.”
nature technology feature, 09 June 2020
HOW TO DEAL WITH EXPLOSIVE GROWTH?
Snapshot from https://icite.od.nih.gov/covid19/search/ on 02 July 2020
7
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
Overwhelming reviewers and leaving good research “in an increasingly longer queue”.
World Health Organisation Director-General
Tedros Adhanom Ghebreyesus said, “We’re not
just fighting an epidemic; we’re fighting an
infodemic”. He was referring to the excessive
amount of information surrounding the
COVID-19 pandemic. Data dashboards,
aggregators and charts of all types have formed
the basis of much of what we know about the
pandemic, lending a veneer of legitimacy to
often contradictory or competing claims.
oecd-development-matters.org, 17 June 2020
FIGHTING AN INFODEMIC - HOW TO NAVIGATE INSTEAD OF DROWN?
SCIENCE|BUSINESS News, 25 June 2020
8
COVID-19 - FROM
REPORT TO APP
2
9
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
First step: A simple overview of a range of recent Springer Nature publications using an AI-based report
- Focus on user needs and pain points: Collaboration with virulogists
- Rapid prototype in the early days of experimental and development
This minimal viable product is in its first
iteration a report that groups publications
meaningfully together based on their topic,
provides snippets of text to grasp what they
are about quickly, and then links directly to
the respective Springer Nature publication.
Metrics like platform downloads, citations
and social media mentions are if importance
to measure the reach and potential impact.
Springer Nature's The Source, 01 April 2020
Given the scale and urgency, cutting through the Corona clutter is more important than ever.
HOW CAN AI SUPPORT THE RESEARCH COMMUNITY IN TIMES OF CRISIS?
10
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
BUT...
… would this be just another solution nobody is aware of?
“So far, these sites have received modest traffic. As of late
May, COVID-19 Primer has welcomed 14,000 unique visitors
per month, and SciSight has seen 11,000 since it launched.
COVIDScholar receives about 500 visitors each day, and
COVID-19 KnetMiner has had that many in total. The Kaggle
challenge has received the most attention, with 1.7 million
page views since it launched in mid-March. Of the COVID-19
researchers we contacted, most had not heard of the
majority of these tools. And still more such tools are being
developed around the world, including Vilokana in India and
CovidAsk in South Korea.”
nature technology feature, 09 June 2020
11
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
We leverage a unique combination of
i) strong brand awareness for this community
ii) deep understanding of needs and networks
iii) applicable tech for entity & topic modelling,
author graphs and auto-aummarisation
vi) industry collaborations (e.g. Digital Science).
Help researchers who need to stay on top by moving from report to an intuitive app.
HOW CAN SN UNIQUELY SUPPORT THE RESEARCH COMMUNITY?
After positive feedback we experiment in various directions in this domain:
● Domain, persona and task specific content recommendations
● Content across all publishers using public CORD-19 dataset
● Reading list, automatic summaries, most important entities
● Most prolific potential collaborators to amplify wayfinders
12
TL;DR EXTREME
SUMMARIZATION
3
13
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
DON’T GIVE ME THE DETAILS, JUST THE SUMMARY?
TLDR as a community acronym stands for “too long; didn’t read”.
BENEFIT
An alternative to abstracts, TLDRs of scientific
papers leave out nonessential background or
methodological details and capture the key
important aspects of the paper, such as its
main contributions.
CHALLENGE
Writing a TLDR of a scientific paper requires
expert background knowledge and complex
domain-specific language understanding to
both identify the salient aspects of the paper
while maintaining faithfulness to the source
and correctness of the written summary AI² preprint (April 2020), GitHub code, live demo
Slide heading: Shashi Narayan et al., ACL 2018
14
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
After trying out the public demo:
• Positive paper author feedback on correctness and usefulness
• Next step is extending this to other domains (training corpora)
Gathering feedback and contemplating a publisher follow-up.
DOES THIS WORK WELL FOR COMPUTER SCIENCE?
Paper Title TLDR Correct
Useful?
(5 very, 1
not at all)
Optimization of Hybrid Electric Drive
System Components in Long-Haul
Vehicles for the Evaluation of
Customer Requirements
An evolutionary optimization
algorithm to optimize the drivetrain
components for a European long-haul
truck concept under consideration of
a highly detailed TCO model.
yes 5
Multi-Objective Optimization of a
Long-Haul Truck Hybrid Operational
Strategy And a Predictive Powertrain
Control System
A rule-based control strategy for a
parallel hybrid topology with
fuel-saving functions and Predictive
Cruise Control.
yes 5
15
NATURAL
LANGUAGE
GENERATION
4
16
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
What are language models?
Simply put, they can predict the most likely next words and
their probabilities given an input primer, similar to Google
query autocomplete. Natural Language Processing (NLP) is
increasingly moving in the direction of creating pre-trained
language models using huge (!) text corpora, which can
later be fine-tuned on specific text domains and tasks.
Possibly the ultimate step to abstractive summarization and way beyond.
WHAT IS THE HYPE AROUND LANGUAGE MODELS ABOUT?
↓Video OpenAIPassing the Turing Test?
Transformers have risen admirably to the challenge of
language generation with many models capable of
generating impressive sequences of text that is not only
grammatically correct but also cohesive and meaningful.
↑Chart towards data science, 31 May 2020
17
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
HOW TO AUTOMATE SCIENTIFIC CONTENT GENERATION?
Apart from technical advances, many challenges and caveats remain.
Trained GPT-2 on ACL reference
corpus (20k papers’ introductions)
DEMO
Augmented / Actionable
Intelligence / Insights
DISCUSSION
|Primer| Springer Nature should publish a computer-generated
book about computational linguistics because... it offers a broad
perspective. Recently, the computational field of Natural Language
Processing has seen an exponential expansion due to the rapidly
growing volume of written and spoken language data, which has
led to interest in the automatic interpretation of natural language
texts. (...) In order to produce such books, we will need a large
number of computer-generated texts. However, such a large
number of texts is not a realistic goal for the foreseeable future.
We believe that it is possible to produce texts of a certain level of
quality that would be of interest to a large audience.
We cannot delete the human in the loop.
- Blackbox: origin of the generated text
- Full transparency and ethics imperative
- Controlling many guided generations
A.I. =
Image on the top right: STM Tech Trends 2024
+ Help researchers to kick-start writing
+ Use this tech for science journalism
+ Dynamically created topical pages
18
INTELLIGENT
QUESTION
ANSWERING
5
19
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
- Language model pre-training captures a surprising amount of
world knowledge, crucial for NLP tasks like question answering.
- Challenge: The knowledge is stored implicitly in the parameters
of a neural network, requiring ever bigger networks (cf. GPT-3)
- REALM: Retrieval-Augmented Language Model Pre-Training.
This allows retrieve and attend over documents from a large
corpus such as Wikipedia, which offers quite technical content.
- After applying this to the open-domain question answering
challenge, this approach outperforms SotA models by 4-16%
- Qualitative benefits such as interpretability and modularity.
Augment language representation with knowledge retriever.
Illustrations from Google’s AI Blog - REALM preprint here
HOW TO MAKE IMPLICIT KNOWLEDGE EXPLICIT?
20
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
- With Data QnA, Google Cloud
now enables any user to
conversationally analyze data
- Data QnA can be embedded
where users work, including
chatbots, spreadsheets, etc.
- How it works:
- Formulate free-form text
analytical questions, with
auto-suggested entities.
- English interpretation, SQL
query and answer returned.
- “Did-you-mean” clarifications if
a question was ambiguous.
Via natural language interfaces for SQL querying.
EMPOWERING NON-TECH AUDIENCE?
Video and additional info: Google blog ‘Ask questions to BigQuery and get instant answers through Data QnA’
21
WRAP-UP:
RE-IMAGINING
RE-SEARCH
6
22
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
HOW CAN WE MAKE (RE)SEARCH A DELIGHTFUL TASK?
Address information overload by summarising rather than filtering.
Google first improved search in the 1990s, but with the
volume of scientific articles filtering to a reasonable set of
relevant papers to review is impossible in some fields.
Summarising content has been a manual exercise, often
delegated to junior researchers to perform a literature
review. Advances in AI can automate summarisation
activities, freeing researchers up to find new discoveries.
Since 2000s search has been augmented with collaborative
filtering on social networks such as ResearchGate, Twitter
and WeChat. This provides researchers with serendipitous
reading, but can miss the content they need.
23
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
WHAT ARE THE KEY TAKEAWAYS FROM THIS?
Latest tech. developments and user expectations fuel this.
Highly
personalized
Specific content recommendations:
- field of research;
- person & role;
- task at hand.
PERSONAL
Automatically
summarized
FOCUSSED
Summaries can be tweaked to fit a
user’s context, domain, expertise and
time - all the way from TL;DRs to
section and full paper summaries.
Easily
accessible
INTUITIVE
Cutting edge information retrieval
and intuitive question answering on
research data tables will increase the
pace of scientific discovery.
Illustrations from Unsplash (left 2 right)
THANK YOU
24
Presentation prepared by
Markus Kaindl
Senior Manager
Data Development
Product and Platform Group
markus.kaindl@springernature.com
25
NISO: Transforming Search | Markus Kaindl | August 26, 2020 |
What are your expectations and worries?
Q & A SESSION AND DISCUSSION

More Related Content

What's hot

Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science suresh sood
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...European Data Forum
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
The State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private SectorThe State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private SectorTim Davies
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the futureSlim Turki, Dr.
 
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Connected Data World
 
Big data and health care
 Big data and health care Big data and health care
Big data and health carecjw119
 
Big data and health care
 Big data and health care Big data and health care
Big data and health carecjw119
 
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...Dr. Haxel Consult
 
Isolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’sIsolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’seSAT Journals
 
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...Dr. Haxel Consult
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )IJDKP
 

What's hot (20)

Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Semantic Web vision and its relevance to Open Digital Data for MGI
Semantic Web vision and its relevance to Open Digital Data for MGISemantic Web vision and its relevance to Open Digital Data for MGI
Semantic Web vision and its relevance to Open Digital Data for MGI
 
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private SectorThe State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private Sector
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the future
 
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
Powerful Information Discovery with Big Knowledge Graphs –The Offshore Leaks ...
 
Big data and health care
 Big data and health care Big data and health care
Big data and health care
 
Big data and health care
 Big data and health care Big data and health care
Big data and health care
 
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...
AI-SDV 2020: Processing Artificial Intelligence: Highlighting the Canadian Pa...
 
Isolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’sIsolating values from big data with the help of four v’s
Isolating values from big data with the help of four v’s
 
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )International Journal of Data Mining & Knowledge Management Process ( IJDKP )
International Journal of Data Mining & Knowledge Management Process ( IJDKP )
 

Similar to Kaindl "Managing Information Overload"

Some New Directions in the Economics of AI
Some New Directions in the Economics of AISome New Directions in the Economics of AI
Some New Directions in the Economics of AIJuan Mateos-Garcia
 
Digital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveDigital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveChristopher Brown
 
NLP Workshop Presentation at Universitat de Barcelona
NLP Workshop Presentation at Universitat de BarcelonaNLP Workshop Presentation at Universitat de Barcelona
NLP Workshop Presentation at Universitat de BarcelonaSergiPons5
 
Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Carlo Vaccari
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
FutureTDM Workshop II 29 March
FutureTDM Workshop II 29 MarchFutureTDM Workshop II 29 March
FutureTDM Workshop II 29 MarchFutureTDM
 
KNOW4DRR ws_polimi_bolzano_2013_introduction
KNOW4DRR  ws_polimi_bolzano_2013_introductionKNOW4DRR  ws_polimi_bolzano_2013_introduction
KNOW4DRR ws_polimi_bolzano_2013_introductionknow4drr
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 
Intelligent Microcontent: At the Point of Content Convergence | Rob Hanna
Intelligent Microcontent: At the Point of Content Convergence | Rob HannaIntelligent Microcontent: At the Point of Content Convergence | Rob Hanna
Intelligent Microcontent: At the Point of Content Convergence | Rob HannaLavaConConference
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017openminted_eu
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationPhilip Bourne
 
Foresight Methods and Practice: Lessons Learned from International Foresight ...
Foresight Methods and Practice: Lessons Learned from International Foresight ...Foresight Methods and Practice: Lessons Learned from International Foresight ...
Foresight Methods and Practice: Lessons Learned from International Foresight ...Totti Könnölä
 
Data Science Skills Study 2019 by AIM And Imarticus Learning
Data Science Skills Study 2019 by AIM And Imarticus LearningData Science Skills Study 2019 by AIM And Imarticus Learning
Data Science Skills Study 2019 by AIM And Imarticus LearningPraj H
 
The International Information Conference on Search, Data Mining and Visualiza...
The International Information Conference on Search, Data Mining and Visualiza...The International Information Conference on Search, Data Mining and Visualiza...
The International Information Conference on Search, Data Mining and Visualiza...Dr. Haxel Consult
 
Sustainability - The Software Perspective
Sustainability - The Software PerspectiveSustainability - The Software Perspective
Sustainability - The Software PerspectivePatricia Lago
 

Similar to Kaindl "Managing Information Overload" (20)

Some New Directions in the Economics of AI
Some New Directions in the Economics of AISome New Directions in the Economics of AI
Some New Directions in the Economics of AI
 
ICWI_2002 (1).pdf
ICWI_2002 (1).pdfICWI_2002 (1).pdf
ICWI_2002 (1).pdf
 
Digital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveDigital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspective
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
NLP Workshop Presentation at Universitat de Barcelona
NLP Workshop Presentation at Universitat de BarcelonaNLP Workshop Presentation at Universitat de Barcelona
NLP Workshop Presentation at Universitat de Barcelona
 
Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
FutureTDM Workshop II 29 March
FutureTDM Workshop II 29 MarchFutureTDM Workshop II 29 March
FutureTDM Workshop II 29 March
 
KNOW4DRR ws_polimi_bolzano_2013_introduction
KNOW4DRR  ws_polimi_bolzano_2013_introductionKNOW4DRR  ws_polimi_bolzano_2013_introduction
KNOW4DRR ws_polimi_bolzano_2013_introduction
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
Faut-il se laisser digitaliser ?
Faut-il se laisser digitaliser ?Faut-il se laisser digitaliser ?
Faut-il se laisser digitaliser ?
 
Intelligent Microcontent: At the Point of Content Convergence | Rob Hanna
Intelligent Microcontent: At the Point of Content Convergence | Rob HannaIntelligent Microcontent: At the Point of Content Convergence | Rob Hanna
Intelligent Microcontent: At the Point of Content Convergence | Rob Hanna
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
Cirad Concours
Cirad ConcoursCirad Concours
Cirad Concours
 
Foresight Methods and Practice: Lessons Learned from International Foresight ...
Foresight Methods and Practice: Lessons Learned from International Foresight ...Foresight Methods and Practice: Lessons Learned from International Foresight ...
Foresight Methods and Practice: Lessons Learned from International Foresight ...
 
Data Science Skills Study 2019 by AIM And Imarticus Learning
Data Science Skills Study 2019 by AIM And Imarticus LearningData Science Skills Study 2019 by AIM And Imarticus Learning
Data Science Skills Study 2019 by AIM And Imarticus Learning
 
The International Information Conference on Search, Data Mining and Visualiza...
The International Information Conference on Search, Data Mining and Visualiza...The International Information Conference on Search, Data Mining and Visualiza...
The International Information Conference on Search, Data Mining and Visualiza...
 
Global alliance for Research and Innovation in Media
Global alliance for Research and Innovation in MediaGlobal alliance for Research and Innovation in Media
Global alliance for Research and Innovation in Media
 
Sustainability - The Software Perspective
Sustainability - The Software PerspectiveSustainability - The Software Perspective
Sustainability - The Software Perspective
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 

Recently uploaded

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 

Recently uploaded (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 

Kaindl "Managing Information Overload"

  • 1. IMPLEMENTING SEARCH ENHANCEMENTS Managing Information Overload Markus Kaindl 26 August 2020 NISO Virtual Conference Topic: Transforming Search
  • 2. Your Speaker on LinkedIn. As a father of two and tech enthusiast I fully embrace knowledge dissemination.
  • 3. 2 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | 1 Information Overload Challenges 2 COVID-19 - From Report to App 3 TL;DR Extreme Summarization 4 Natural Language Generation 5 Intelligent Question Answering 6 Wrap-Up: Re-Imagining Re-Search CONTENTS
  • 5. 4 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | OUR BRANDS PUBLISH A LOT OF RESEARCH! OUR RESEARCH IMPRINTS
  • 6. 5 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | MANAGING THE INFORMATION OVERLOAD Addressing it in our role as a publisher in various ways. ANSWER PUBLISHING QUESTIONS ● Engage with our communities ● Shape the future of publishing HELP RESEARCHERS ● Save time entering new field ● Save time staying up tp date ● Help to overcome research silos PROVIDE SERVICE TO SN AUTHORS ● Instant literature overview ● Automated ToC generation ● Text generation STRUCTURE EXISTING CONTENT ● Develop and offer new products like machine-generated books, reports or applications ● Auto-clustering of subject areas
  • 7. 6 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | New solutions need to tame the coronavirus literature. “The COVID-19 literature has grown in much the same way as the disease’s transmission: exponentially. The NIH’s COVID-19 Portfolio, a website that tracks papers related to the SARS-CoV-2 coronavirus and the disease it causes, lists more than 28,000 articles — far too many for any researcher to read.” nature technology feature, 09 June 2020 HOW TO DEAL WITH EXPLOSIVE GROWTH? Snapshot from https://icite.od.nih.gov/covid19/search/ on 02 July 2020
  • 8. 7 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | Overwhelming reviewers and leaving good research “in an increasingly longer queue”. World Health Organisation Director-General Tedros Adhanom Ghebreyesus said, “We’re not just fighting an epidemic; we’re fighting an infodemic”. He was referring to the excessive amount of information surrounding the COVID-19 pandemic. Data dashboards, aggregators and charts of all types have formed the basis of much of what we know about the pandemic, lending a veneer of legitimacy to often contradictory or competing claims. oecd-development-matters.org, 17 June 2020 FIGHTING AN INFODEMIC - HOW TO NAVIGATE INSTEAD OF DROWN? SCIENCE|BUSINESS News, 25 June 2020
  • 10. 9 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | First step: A simple overview of a range of recent Springer Nature publications using an AI-based report - Focus on user needs and pain points: Collaboration with virulogists - Rapid prototype in the early days of experimental and development This minimal viable product is in its first iteration a report that groups publications meaningfully together based on their topic, provides snippets of text to grasp what they are about quickly, and then links directly to the respective Springer Nature publication. Metrics like platform downloads, citations and social media mentions are if importance to measure the reach and potential impact. Springer Nature's The Source, 01 April 2020 Given the scale and urgency, cutting through the Corona clutter is more important than ever. HOW CAN AI SUPPORT THE RESEARCH COMMUNITY IN TIMES OF CRISIS?
  • 11. 10 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | BUT... … would this be just another solution nobody is aware of? “So far, these sites have received modest traffic. As of late May, COVID-19 Primer has welcomed 14,000 unique visitors per month, and SciSight has seen 11,000 since it launched. COVIDScholar receives about 500 visitors each day, and COVID-19 KnetMiner has had that many in total. The Kaggle challenge has received the most attention, with 1.7 million page views since it launched in mid-March. Of the COVID-19 researchers we contacted, most had not heard of the majority of these tools. And still more such tools are being developed around the world, including Vilokana in India and CovidAsk in South Korea.” nature technology feature, 09 June 2020
  • 12. 11 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | We leverage a unique combination of i) strong brand awareness for this community ii) deep understanding of needs and networks iii) applicable tech for entity & topic modelling, author graphs and auto-aummarisation vi) industry collaborations (e.g. Digital Science). Help researchers who need to stay on top by moving from report to an intuitive app. HOW CAN SN UNIQUELY SUPPORT THE RESEARCH COMMUNITY? After positive feedback we experiment in various directions in this domain: ● Domain, persona and task specific content recommendations ● Content across all publishers using public CORD-19 dataset ● Reading list, automatic summaries, most important entities ● Most prolific potential collaborators to amplify wayfinders
  • 14. 13 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | DON’T GIVE ME THE DETAILS, JUST THE SUMMARY? TLDR as a community acronym stands for “too long; didn’t read”. BENEFIT An alternative to abstracts, TLDRs of scientific papers leave out nonessential background or methodological details and capture the key important aspects of the paper, such as its main contributions. CHALLENGE Writing a TLDR of a scientific paper requires expert background knowledge and complex domain-specific language understanding to both identify the salient aspects of the paper while maintaining faithfulness to the source and correctness of the written summary AI² preprint (April 2020), GitHub code, live demo Slide heading: Shashi Narayan et al., ACL 2018
  • 15. 14 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | After trying out the public demo: • Positive paper author feedback on correctness and usefulness • Next step is extending this to other domains (training corpora) Gathering feedback and contemplating a publisher follow-up. DOES THIS WORK WELL FOR COMPUTER SCIENCE? Paper Title TLDR Correct Useful? (5 very, 1 not at all) Optimization of Hybrid Electric Drive System Components in Long-Haul Vehicles for the Evaluation of Customer Requirements An evolutionary optimization algorithm to optimize the drivetrain components for a European long-haul truck concept under consideration of a highly detailed TCO model. yes 5 Multi-Objective Optimization of a Long-Haul Truck Hybrid Operational Strategy And a Predictive Powertrain Control System A rule-based control strategy for a parallel hybrid topology with fuel-saving functions and Predictive Cruise Control. yes 5
  • 17. 16 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | What are language models? Simply put, they can predict the most likely next words and their probabilities given an input primer, similar to Google query autocomplete. Natural Language Processing (NLP) is increasingly moving in the direction of creating pre-trained language models using huge (!) text corpora, which can later be fine-tuned on specific text domains and tasks. Possibly the ultimate step to abstractive summarization and way beyond. WHAT IS THE HYPE AROUND LANGUAGE MODELS ABOUT? ↓Video OpenAIPassing the Turing Test? Transformers have risen admirably to the challenge of language generation with many models capable of generating impressive sequences of text that is not only grammatically correct but also cohesive and meaningful. ↑Chart towards data science, 31 May 2020
  • 18. 17 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | HOW TO AUTOMATE SCIENTIFIC CONTENT GENERATION? Apart from technical advances, many challenges and caveats remain. Trained GPT-2 on ACL reference corpus (20k papers’ introductions) DEMO Augmented / Actionable Intelligence / Insights DISCUSSION |Primer| Springer Nature should publish a computer-generated book about computational linguistics because... it offers a broad perspective. Recently, the computational field of Natural Language Processing has seen an exponential expansion due to the rapidly growing volume of written and spoken language data, which has led to interest in the automatic interpretation of natural language texts. (...) In order to produce such books, we will need a large number of computer-generated texts. However, such a large number of texts is not a realistic goal for the foreseeable future. We believe that it is possible to produce texts of a certain level of quality that would be of interest to a large audience. We cannot delete the human in the loop. - Blackbox: origin of the generated text - Full transparency and ethics imperative - Controlling many guided generations A.I. = Image on the top right: STM Tech Trends 2024 + Help researchers to kick-start writing + Use this tech for science journalism + Dynamically created topical pages
  • 20. 19 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | - Language model pre-training captures a surprising amount of world knowledge, crucial for NLP tasks like question answering. - Challenge: The knowledge is stored implicitly in the parameters of a neural network, requiring ever bigger networks (cf. GPT-3) - REALM: Retrieval-Augmented Language Model Pre-Training. This allows retrieve and attend over documents from a large corpus such as Wikipedia, which offers quite technical content. - After applying this to the open-domain question answering challenge, this approach outperforms SotA models by 4-16% - Qualitative benefits such as interpretability and modularity. Augment language representation with knowledge retriever. Illustrations from Google’s AI Blog - REALM preprint here HOW TO MAKE IMPLICIT KNOWLEDGE EXPLICIT?
  • 21. 20 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | - With Data QnA, Google Cloud now enables any user to conversationally analyze data - Data QnA can be embedded where users work, including chatbots, spreadsheets, etc. - How it works: - Formulate free-form text analytical questions, with auto-suggested entities. - English interpretation, SQL query and answer returned. - “Did-you-mean” clarifications if a question was ambiguous. Via natural language interfaces for SQL querying. EMPOWERING NON-TECH AUDIENCE? Video and additional info: Google blog ‘Ask questions to BigQuery and get instant answers through Data QnA’
  • 23. 22 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | HOW CAN WE MAKE (RE)SEARCH A DELIGHTFUL TASK? Address information overload by summarising rather than filtering. Google first improved search in the 1990s, but with the volume of scientific articles filtering to a reasonable set of relevant papers to review is impossible in some fields. Summarising content has been a manual exercise, often delegated to junior researchers to perform a literature review. Advances in AI can automate summarisation activities, freeing researchers up to find new discoveries. Since 2000s search has been augmented with collaborative filtering on social networks such as ResearchGate, Twitter and WeChat. This provides researchers with serendipitous reading, but can miss the content they need.
  • 24. 23 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | WHAT ARE THE KEY TAKEAWAYS FROM THIS? Latest tech. developments and user expectations fuel this. Highly personalized Specific content recommendations: - field of research; - person & role; - task at hand. PERSONAL Automatically summarized FOCUSSED Summaries can be tweaked to fit a user’s context, domain, expertise and time - all the way from TL;DRs to section and full paper summaries. Easily accessible INTUITIVE Cutting edge information retrieval and intuitive question answering on research data tables will increase the pace of scientific discovery. Illustrations from Unsplash (left 2 right)
  • 25. THANK YOU 24 Presentation prepared by Markus Kaindl Senior Manager Data Development Product and Platform Group markus.kaindl@springernature.com
  • 26. 25 NISO: Transforming Search | Markus Kaindl | August 26, 2020 | What are your expectations and worries? Q & A SESSION AND DISCUSSION