This document summarizes Lora Aroyo's presentation on enabling digital humanities. The presentation discusses expanding human cognition with technology, teaching machines through crowdsourcing diverse interpretations, and engaging users through novel interfaces like event-based browsing of linked historical media. The goals are to support interpretation, gather human semantics, and allow natural interaction. Challenges and the road ahead are also outlined.
"It's 2010: 20 Technologies to Watch, and How to Cope" for SLA's Click University. The real secret is that the best way to cope is to remain positive and reframe our perception of the changes. They don't happen to us. We make them happen. Create the future. Who is better qualified to help invent the information and knowledge based economy than us?
Changing contexts: museums, audiences and technologyMia
A presentation for the International Training Programme run by the British Museum for museum professionals from around the world. This is based on a presentation I prepared for OpenCulture 2011, but includes additional material on mobile phones/devices including the 'Hidden Histories' pilot.
Slides from Adobe webinar, July 17, 2013. Complete recording of the webinar at http://adobe.ly/15NgqPx
Paradoxically, while mobile devices are proliferating, our habits are becoming ever more sedentary; Internet is not only a communication tool but a creator of new spaces that we occupy without moving. In this new virtual universe, space has different dimensions and constraints, which users are learning to understand for personal and professional development
"It's 2010: 20 Technologies to Watch, and How to Cope" for SLA's Click University. The real secret is that the best way to cope is to remain positive and reframe our perception of the changes. They don't happen to us. We make them happen. Create the future. Who is better qualified to help invent the information and knowledge based economy than us?
Changing contexts: museums, audiences and technologyMia
A presentation for the International Training Programme run by the British Museum for museum professionals from around the world. This is based on a presentation I prepared for OpenCulture 2011, but includes additional material on mobile phones/devices including the 'Hidden Histories' pilot.
Slides from Adobe webinar, July 17, 2013. Complete recording of the webinar at http://adobe.ly/15NgqPx
Paradoxically, while mobile devices are proliferating, our habits are becoming ever more sedentary; Internet is not only a communication tool but a creator of new spaces that we occupy without moving. In this new virtual universe, space has different dimensions and constraints, which users are learning to understand for personal and professional development
Makerspaces: a great opportunity to enhance academic libraries, Stellenbosch...Fers
Presentation at Stellenbosch University 14th Annual Library Symposium
Stellenbosch Institute for Advance Studies (STIAS)
November 3, 2016 – November 4, 2016
http://conferences.sun.ac.za/index.php/sulis_symp14/SUALS14
Shaping the academic library of the future: adapt, empower, partner, engage
The tradition of the Stellenbosch University Annual Library Symposium of being a platform for discussing new library and information services and developments will continue at the 14th Annual Library Symposium in November 2016. The discussion will be turned to the shaping of the academic library of the future. Emphasis will be placed on the importance of adapting to users’ needs, adapting to new roles as librarians, adapting as a library, empowerment of staff and clients and the importance of partnerships and engagement.
The theme is partly based on the recent OCLC report, Shaping the Library to the life of the user: adapting, empowering, partnering, engaging. In this report it becomes clear that “research and learning needs are changing. Higher education is reconfiguring. As a result of these massive changes, the library must pivot and adapt”. The following core themes are mentioned in this report: to empower users, to empower the library, form partnerships and to engage the campus community. The report advises libraries to be able to “move from offering a fixed set of services to a ‘constant beta’ mode of service evolution” .
Three different sessions will focus on the following themes, all related to the shaping of the academic library of the future:
Digital innovation: Topics of this session may include Augmented Reality, Artificial Intelligence, Research Data Management, Big Data, E-learning, Digital Humanities, Next Generation Systems.
Collaboration / Partnerships : Partnerships with OCLC, SANLIC and other consortia as well as other collaboration possibilities will be discussed.
The user experience: Academic staff, researchers and students of local universities will share their expectations as users of academic libraries.
Learning takes place in a variety of settings as an ongoing process of skills and knowledge development in changing contexts. With the growing popularity of technology-enhanced learning initiatives, we make the case for more flexible methods for skills and knowledge recognition. The challenge is to create more versatile ways of recognizing uncertified forms of learning – both for formal qualifications of informal learning as well as wider social recognition of uncertified knowledge.
Makerspaces in Bibliotheken, OBA, 31 oktober 2016Fers
Presentatie bij Waag Society over bibliotheeklabs, bestemd voor medewerkers van de Openbare Bibliotheek Amsterdam. Zij beginnen aan een traject waarbij meerdere labs in Amsterdamse bibliotheken worden geïmplementeerd.
Shift to the future – the technological disruption of learning and work - cio...Brian Kuhn
We live in unprecedented times... technology is disrupting learning and work as we know it. Machines are taking over more and more jobs and we need to be preparing this generation for a very uncertain unpredictable rapidly changing world. Education needs to shift to a world of abundant knowledge, leveraged through networks, and knowledge engines. CIO's need to be futurists and guides for their organizations to lead them forward into the future.
My "distinguished speaker" presentation for the global online Library 2.013 conference.
"Library Futures & Tech Directions"
By Joe Murphy, Librarian. Director, Library Futures at Innovative Interfaces, Inc.
The #svegliamuseo project and the concept of a network of digital communicati...#svegliamuseo
On the occasion of the International Conference of Information Technologies for Epigraphy and Digital Cultural Heritage in the Ancient World (EAGLE 2014 , September, 29 – October 1st ), #svegliamuseo took part to the session ‘Who cares? Users, epigraphy and the social web’ and presented the work done for the creation of a strong community rotating around the topic of digital communication and involving as many museum professionals as possible.
EUScreen XL 2014 Conference: DIVE In Digital HermeneuticsLora Aroyo
My talk on "The Event-based Browsing Of Linked Historical Media" @EUScreenXL 2014 Conference
http://dive.beeldengeluid.nl
http://blog.euscreen.eu/archives/5607
Makerspaces: a great opportunity to enhance academic libraries, Stellenbosch...Fers
Presentation at Stellenbosch University 14th Annual Library Symposium
Stellenbosch Institute for Advance Studies (STIAS)
November 3, 2016 – November 4, 2016
http://conferences.sun.ac.za/index.php/sulis_symp14/SUALS14
Shaping the academic library of the future: adapt, empower, partner, engage
The tradition of the Stellenbosch University Annual Library Symposium of being a platform for discussing new library and information services and developments will continue at the 14th Annual Library Symposium in November 2016. The discussion will be turned to the shaping of the academic library of the future. Emphasis will be placed on the importance of adapting to users’ needs, adapting to new roles as librarians, adapting as a library, empowerment of staff and clients and the importance of partnerships and engagement.
The theme is partly based on the recent OCLC report, Shaping the Library to the life of the user: adapting, empowering, partnering, engaging. In this report it becomes clear that “research and learning needs are changing. Higher education is reconfiguring. As a result of these massive changes, the library must pivot and adapt”. The following core themes are mentioned in this report: to empower users, to empower the library, form partnerships and to engage the campus community. The report advises libraries to be able to “move from offering a fixed set of services to a ‘constant beta’ mode of service evolution” .
Three different sessions will focus on the following themes, all related to the shaping of the academic library of the future:
Digital innovation: Topics of this session may include Augmented Reality, Artificial Intelligence, Research Data Management, Big Data, E-learning, Digital Humanities, Next Generation Systems.
Collaboration / Partnerships : Partnerships with OCLC, SANLIC and other consortia as well as other collaboration possibilities will be discussed.
The user experience: Academic staff, researchers and students of local universities will share their expectations as users of academic libraries.
Learning takes place in a variety of settings as an ongoing process of skills and knowledge development in changing contexts. With the growing popularity of technology-enhanced learning initiatives, we make the case for more flexible methods for skills and knowledge recognition. The challenge is to create more versatile ways of recognizing uncertified forms of learning – both for formal qualifications of informal learning as well as wider social recognition of uncertified knowledge.
Makerspaces in Bibliotheken, OBA, 31 oktober 2016Fers
Presentatie bij Waag Society over bibliotheeklabs, bestemd voor medewerkers van de Openbare Bibliotheek Amsterdam. Zij beginnen aan een traject waarbij meerdere labs in Amsterdamse bibliotheken worden geïmplementeerd.
Shift to the future – the technological disruption of learning and work - cio...Brian Kuhn
We live in unprecedented times... technology is disrupting learning and work as we know it. Machines are taking over more and more jobs and we need to be preparing this generation for a very uncertain unpredictable rapidly changing world. Education needs to shift to a world of abundant knowledge, leveraged through networks, and knowledge engines. CIO's need to be futurists and guides for their organizations to lead them forward into the future.
My "distinguished speaker" presentation for the global online Library 2.013 conference.
"Library Futures & Tech Directions"
By Joe Murphy, Librarian. Director, Library Futures at Innovative Interfaces, Inc.
The #svegliamuseo project and the concept of a network of digital communicati...#svegliamuseo
On the occasion of the International Conference of Information Technologies for Epigraphy and Digital Cultural Heritage in the Ancient World (EAGLE 2014 , September, 29 – October 1st ), #svegliamuseo took part to the session ‘Who cares? Users, epigraphy and the social web’ and presented the work done for the creation of a strong community rotating around the topic of digital communication and involving as many museum professionals as possible.
EUScreen XL 2014 Conference: DIVE In Digital HermeneuticsLora Aroyo
My talk on "The Event-based Browsing Of Linked Historical Media" @EUScreenXL 2014 Conference
http://dive.beeldengeluid.nl
http://blog.euscreen.eu/archives/5607
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Lora Aroyo
Presentation at the "Past, Present and Future of Digital Humanities & Social Sciences in the Netherlands" event, http://www.ehumanities.nl/past-present-and-future-of-digital-humanities-social-sciences-in-the-netherlands-programme-and-abstracts-2/
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
Software systems are becoming ever more intelligent and more useful, but the way we interact with these machines too often reveals that they don’t actually understand people. Knowledge Representation and Semantic Web focus on the scientific challenges involved in providing human knowledge in machine-readable form. However, we observe that various types of human knowledge cannot yet be captured by machines, especially when dealing with wide ranges of real-world tasks and contexts. The key scientific challenge is to provide an approach to capturing human knowledge in a way that is scalable and adequate to real-world needs. Human Computation has begun to scientifically study how human intelligence at scale can be used to methodologically improve machine-based knowledge and data management. My research is focusing on understanding human computation for improving how machine-based systems can acquire, capture and harness human knowledge and thus become even more intelligent. In this talk I will show how the CrowdTruth framework (http://crowdtruth.org) facilitates data collection, processing and analytics of human computation knowledge.
Some project links:
- http://controcurator.org/
- http://crowdtruth.org/
- http://diveproject.beeldengeluid.nl/
- http://vu-amsterdam-web-media-group.github.io/linkflows/
Talk given at MuseumNext 2013 in Amsterdam.
In this talk I focus on artificial intelligence and semantic technologies as key trends and distruptive tools and try to tie these to the importance of a robust and flexible digital museum platform.
http://www.museumnext.org/schedule/
Presentation to the CURSO DE VERANO
Bilbao Arte eta Kultura UPV/EHU: museos, redes sociales y tecnología 2.0 (museums, social networks and 2.0 technology)
6-7 July 2010 at the invitation of the University of the Basque Country.
http://tubilbao.blogspot.com/2010/06/bak-uda-ikastaroa-curso-de-verano.html
Similar to Expand. Learn. Interact: Enabling Digital Humanities (20)
The Rijksmuseum Collection as Linked DataLora Aroyo
Presentation at ISWC2018: http://iswc2018.semanticweb.org/sessions/the-rijksmuseum-collection-as-linked-data/ of our paper published originally in the Semantic Web Journal: http://www.semantic-web-journal.net/content/rijksmuseum-collection-linked-data-2
Many museums are currently providing online access to their collections. The state of the art research in the last decade shows that it is beneficial for institutions to provide their datasets as Linked Data in order to achieve easy cross-referencing, interlinking and integration. In this paper, we present the Rijksmuseum linked dataset (accessible at http://datahub.io/dataset/rijksmuseum), along with collection and vocabulary statistics, as well as lessons learned from the process of converting the collection to Linked Data. The version of March 2016 contains over 350,000 objects, including detailed descriptions and high-quality images released under a public domain license.
FAIRview: Responsible Video Summarization @NYCML'18Lora Aroyo
Presentation at the NYC Media Lab (NYCML2018). There is a growing demand for news videos online, with more consumers preferring to watch the news than read or listen to it. On the publisher side, there is a growing effort to use video summarization technology in order to create easy-to-consume previews (trailers) for different types of broadcast programs. How can we measure the quality of video summaries and their potential to misinform? This workshop will inform participants about automatic video summarization algorithms and how to produce more “representative” video summaries. The research presented is from the FAIRview project and is supported by the Digital News Innovation Fund (DNI Fund), which is part of the Google News Initiative.
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
Lora Aroyo, Chiel van den Akker, Marnix van Berchum, Lodewijk
Petram, Gerard Kuys, Tommaso Caselli, Jacco van Ossenbruggen, Victor de Boer, Sabrina Sauer, Berber Hagedoorn
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Lora Aroyo
The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to the volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, this assumption often creates issues in practice. Previous experiments we performed found that inter-annotator disagreement is usually never captured, either because the number of annotators is too small to capture the full diversity of opinion, or because the crowd data is aggregated with metrics that enforce consensus, such as majority vote. These practices create artificial data that is neither general nor reflects the ambiguity inherent in the data.
To address these issues, we proposed the method for crowdsourcing ground truth by harnessing inter-annotator disagreement. We present an alternative approach for crowdsourcing ground truth data that, instead of enforcing an agreement between annotators, captures the ambiguity inherent in semantic annotation through the use of disagreement-aware metrics for aggregating crowdsourcing responses. Based on this principle, we have implemented the CrowdTruth framework for machine-human computation, that first introduced the disagreement-aware metrics and built a pipeline to process crowdsourcing data with these metrics.
In this paper, we apply the CrowdTruth methodology to collect data over a set of diverse tasks: medical relation extraction, Twitter event identification, news event extraction and sound interpretation. We prove that capturing disagreement is essential for acquiring a high-quality ground truth. We achieve this by comparing the quality of the data aggregated with CrowdTruth metrics with a majority vote, a method which enforces consensus among annotators. By applying our analysis over a set of diverse tasks we show that, even though ambiguity manifests differently depending on the task, our theory of inter-annotator disagreement as a property of ambiguity is generalizable.
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneLora Aroyo
Ambiguity in interpreting signs is not a new idea, yet the vast majority of research in machine interpretation of signals such as speech, language, images, video, audio, etc., tend to ignore ambiguity. This is evidenced by the fact that metrics for quality of machine understanding rely on a ground truth, in which each instance (a sentence, a photo, a sound clip, etc) is assigned a discrete label, or set of labels, and the machine’s prediction for that instance is compared to the label to determine if it is correct. This determination yields the familiar precision, recall, accuracy, and f-measure metrics, but clearly presupposes that this determination can be made. CrowdTruth is a form of collective intelligence based on a vector representation that accommodates diverse interpretation perspectives and encourages human annotators to disagree with each other, in order to expose latent elements such as ambiguity and worker quality. In other words, CrowdTruth assumes that when annotators disagree on how to label an example, it is because the example is ambiguous, the worker isn’t doing the right thing, or the task itself is not clear. In previous work on CrowdTruth, the focus was on how the disagreement signals from low quality workers and from unclear tasks can be isolated. Recently, we observed that disagreement can also signal ambiguity. The basic hypothesis is that, if workers disagree on the correct label for an example, then it will be more difficult for a machine to classify that example. The elaborate data analysis to determine if the source of the disagreement is ambiguity supports our intuition that low clarity signals ambiguity, while high clarity sentences quite obviously express one or more of the target relations. In this talk I will share the experiences and lessons learned on the path to understanding diversity in human interpretation and the ways to capture it as ground truth to enable machines to deal with such diversity.
Stitch by Stitch: Annotating Fashion at the RijksmuseumLora Aroyo
https://www.rijksmuseum.nl/en/stitch-by-stitch
http://annotate.accurator.nl/
Fashion can be found everywhere in museums. Fashion heritage collected over centuries: costumes, accessories, paintings, prints and photographs. But while some clothes and accessories are easily found and identified, others are obscure and require a trained eye to describe. What are we looking at? What kind of sleeve is this? Which materials and techniques have been used? More specific descriptions of the images facilitate better use of digital collections and enable users to wander through them in detail.
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...Lora Aroyo
http://mw2016.museumsandtheweb.com/proposal/accurator-enriching-collections-with-expert-knowledge-from-the-crowd/
Crowdsourcing is not a new phenomenon for museums. There are good examples for museums (e.g., Powerhouse museum, steve.museum). But not all crowdsourcing initiatives are successful. Crowdsourced tagging does not always contribute to a better understanding of art and can even be confusing.
The Rijksmuseum and the VU University Amsterdam developed the Accurator: a visual tool to get experts in domains like birds, bibles, ships, castles, etc. involved in annotating art and enrich the museums’ metadata with expertise that is not available internally.
In this how-to session, we demonstrate the tool and the ways other museums can implement this Open Web application for their own collections.
7. Mul9tude
of
collec9ons
Mul9-‐
dimensional
Mul9-‐
perspec9ve
“Jungle
of
scales”
Mul9-‐granularity
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
8. Digital
Heritage
Mission
“We
enable
everyone
to
u/lize
the
collec/ons
Lora Aroyo
to
learn,
experience
&
create.”
(mission
Sound
and
Vision)
“a
place
where
a
na/on
nourishes
its
memory,
and
exerts
its
imagina/on
-‐
where
it
connects
with
its
past
and
invents
its
future”
(Pierre
Ryckmans,
1996)
„places
that
foster
curiosity,
celebrate
ideas
and
enable
discovery.”
(sxswLAM
community,
hHelp
people
Mp://www.sxswlam.com/)
finding
their
way
9. Informa(on:
Museums
&
Archives
as
Inventories
of
the
World
André
Malraux,
The
Imaginary
Museum
of
World
Sculpture,
1953
11. effec/ve
&
efficient
in
delivering
change
and
tangible
benefits
strategic
advantage
through
the
innova/on
inherent
in
this
digital
ac/vity
economic
benefit
to
community
that
demonstrate
the
value
of
endeavours
in
monetary
terms
(
the
community
has
been
changed
by
the
resource
in
beneficial
ways
that
can
be
clearly
iden/fied
Source:
Simon
Tanner,
Kings
College
London
,
hBp://www.kdcs.kcl.ac.uk/innovaGon/impact.html
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
12. New
Era
of
Compu9ng
• amount
Lora Aroyo
of
data
&
scale
of
computa9on
available
have
increased
by
a
previously
inconceivable
amount
• CS
&
AI
moved
out
of
thought
problems
–
entered
empirical
science
13.
14. Open Domain Question-Answering Machine, that given
– Rich Natural Language Questions
– Over a Broad Domain of Knowledge
Won a 2-game Jeopardy match against the all-time winners
– viewed by over 50,000,000
IBM
Confiden/al
15. Enabling
(Digital)
Humani9es
Expand.
Learn.
Interact.
How
to
Do
It?
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
16. 1.
Expand
“EXPAND
human
cogni/on,
makes
the
jobs
we
do
easier,
like
a
cogni9ve
Lora Aroyo
prosthesis,
especially
when
dealing
with
processing
massive
data,
or
data
that
requires
human
interpreta9on”
• Support
for
Interpreta9on
• Structured
Seman9cs
17. … Digital Humanities researchers
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
18. … often don’t find
what they were searching for
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
19. HOW
DO
WE
FILL
THE
GAP?
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
21. • a
Digital
Hermeneu9cs
theory
of
interpretaGon
of
informaGon
• bringing
people
and
technology
together
to
explore:
– how
to
model
and
represent
informa/on
– how
to
provide
engaging
interac/on
– how
to
support
interpreta/on
“Digital
HermeneuGcs:
Agora
and
the
online
understanding
of
cultural
heritage”
In
proceedings
of
Web
Science
Conference,
(ACM:
New
York,
2011)
22. Simple
Event
Model
(SEM)
OpenAnnota/on
(OA)
&
SKOS
DIVE:MEDIA
OBJECT
SEM:EVEN
T
SEM:PLACE
SEM:TIME
SKOS:CONCEP
T
SEM:ACTOR
OA:ANNOTATIO
N
• Links
to
EUROPEANA
(mul/lingual)
• Links
to
DBPedia,
other
vocabularies
23. Erp,
M.
van;
Oomen,
J.;
Segers,
R.;
Akker,
C.
van
de;
Aroyo,
L.;
Jacobs,
G.;
Legêne,
S;
Meij,
L.
van
der;O
ssenbruggen,
J.R.
van;
Schreiber,
G.
Automa/c
Heritage
Metadata
Enrichment
with
Historic
Events
Museums
and
the
Web
2011
hMp://www.museumsandtheweb.com/mw2011/
papers/automa/c_heritage_metadata_enrichment_with_hi
27. 2.
Learn
“LEARN
as
you
use
it
–
most
machine
errors
are
easy
for
a
human
to
detect,
and
we
can
instrument
usage
of
systems
to
beMer
understand
the
system
and
the
problem
it
solves”
• Teach
Lora Aroyo
machines
• Deal
with
diversity
of
interpreta9on
29. Diversity,
Opinions,
Interpreta9ons
Human Disagreement IS Essential in Helping
Machines with Semantic Interpretation!
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
30. Crowdsourcing
Diversity
of
Interpreta9on
“best
collec9ve
decisions
are
result
of
disagreement,
not
consensus
or
compromise”
James
Surowiecki
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
31. CrowdTruth.org
A
novel
approach
to
gather
diversity
of
perspec9ves
&
opinions
from
the
crowd,
expand
expert
vocabularies
with
these
and
gather
new
type
of
gold
standard
for
machines
L.
Aroyo,
C.
Welty:
Crowd
Truth:
Harnessing
disagreement
in
crowdsourcing
a
rela/on
extrac/on
gold
standard.
ACM
WebSci
2013.
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
32. Crowdsourcing
for
Video
Analysis
CrowdTruth.org
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
33. Following the grandeur of Baroque, Rococo art is
often dismissed as frivolous and unserious, but
Waldemar Januszczak disagrees. […] The first
episode is about travel in the 18th century and how
it impacted greatly on some of the finest art ever
made. The world was getting smaller and took on
new influences shown in the glorious Bavarian
pilgrimage architecture, Canaletto's romantic
Venice and the blossoming of exotic designs and
tastes all over Europe.
Rococo:
Travel,
pleasure,
madness
A
boarding
school
where
boys
from
the
Dutch
East
Indies
receive
a
voca/onal
educa/on
has
been
set
up
by
the
ministry
of
Social
Affairs
near
Batavia.
Trumpeter
sounds
the
reveille;
-‐
the
boys
get
out
of
bed;
-‐
the
muster
is
held
in
front
of
the
building;
akerwards
the
boys
stand
in
line
with
a
food
bowl;
they
get
rice
pudding
and
eat
this
around
long
tables
in
the
open
air;
-‐
the
boys
get
instruc/ons
and
prac/ce
a.o.
drawing
shapes,
forging,
metalworking,
filing
and
woodworking;
News
from
Indonesia:
youth
care
hdp://vista-‐tv.eu
hdp://dive.beeldengeluid.nl
35. Crowdsourcing
for
Text
Analysis
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
CrowdTruth.org
36. • One
truth:
data
collec/on
efforts
assume
one
correct
interpreta/on
for
every
example
• All
examples
are
created
equal:
ground
truth
treats
all
examples
the
same
–
either
match
the
correct
result
or
not
• Detailed
guidelines
help:
if
examples
cause
disagreement
-‐
add
instruc/ons
to
limit
interpreta/ons
• Disagreement
is
bad:
increase
quality
of
annota/on
data
by
reducing
disagreement
among
the
annotators
• One
is
enough:
most
of
the
annotated
examples
are
evaluated
by
one
person
• Experts
are
beder:
annotators
with
domain
knowledge
provide
beMer
annota/ons
• Once
done,
forever
valid:
annota/ons
are
not
updated;
new
data
not
aligned
with
previous
7
Myths
of
Human
Annota9on
myths
directly
influence
the
prac/ce
of
collec/ng
human
annotated
data;
Need
to
be
revisited
in
the
context
of
new
changing
world
&
in
the
face
of
a
new
theory
of
truth
(CrowdTruth)
Lora Aroyo
37. 3.
Interact
“INTERACT
naturally.
We
need
to
bring
machines
closer
to
their
users,
we
have
adapted
ourselves
enough
to
them,
they
should
understand
natural
Lora Aroyo
language,
spoken
or
wriMen,
be
able
to
process
images
and
videos.
These
simple
human
problems
are
extremely
complex
for
machines,
but
are
hallmarks
of
a
new
compu9ng
era.”
• Gathering
Human
Seman9cs
• Engaging
Users
41. Engage
with
Games
in
Sciences
hdp://game.crowdtruth.org/
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
42. Engage
with
Games
hdp://waisda.nl/
in
Entertainment
hdp://spotvogel.vroegevogels.vara.nl/
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo
43. Engage
with
Novel
Interfaces
Event-‐based
Browsing
Of
Linked
Historical
Media
53. CrowdTruth
Team:
Lora
Aroyo,
Chris
Welty
Robert-‐Jan
Sips,
Carlos
Mar9nez
Or9z,
Anca
Dumitrache,
Oana
Inel,
Benjamin
Timmermans,
Susanna
van
de
Ven,
Merel
van
Empel,
Jelle
v.d.
Ploeg,
Ta/ana
Cristea,
Khalid
Khamkham,
HarriëMe
Smook,
Rens
van
Honschooten,
Arne
Rutjes
CrowdTruth.org
github.com/CrowdTruth
54. Links
On
the
Web
• http://waida.nl
• http://prestoprime.org
• http://agora.cs.vu.nl
• http://sealincmedia.wordpress.com
• http://notube.tv
• http://vista-tv.eu
• http://dive.beeldengeluid.nl
• http://crowdtruth.org
• http://game.crowdtruth.org
• http://wm.cs.vu.nl
On
Twider
@waisda
@agora-‐project
@sealincmedia
@prestocenter
@notubeproject
@vistatv
#CrowdTruth
http://lora-aroyo.org ! http://slideshare.net/laroyo ! @laroyo