WebSci2013 Harnessing Disagreement in Crowdsourcing

•

2 likes•5,872 views

The document discusses harnessing disagreement in crowdsourcing for cognitive computing tasks like relation extraction. Typically, a single gold standard answer is assumed, but the authors argue that annotator disagreement is not just noise but a source of useful information. By capturing and understanding disagreement through frequencies and similarities, machine learning models can be scored based on how well their outputs fit within the space of possible human interpretations. This approach aims to better adapt models to new annotation tasks by tolerating the inherent vagueness and ambiguity of human understanding.

Technology Entertainment & Humor

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
gathering gold standard annotations for relation extraction

Crowd Truth
Harnessing Disagreement in
Crowdsourcing

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Gold Standard
Assumption
• typically in cognitive systems
• for each annotated instance there is a single right answer
• gold standard quality can be measured in inter-annotator
agreement
Let them disagree?

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Hypothesis
Annotator disagreement is not noise, but signal.
Not a problem to overcome but a source of information for machines
Artificially restricting humans does not help machines to learn.
They will learn better from diversity

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Position
disagreement is a sign of
intrinsic vagueness & ambiguity in human understanding

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Approach Principles
1.  Tolerate, capture & exploit disagreement
2.  Understand it by a space of possibilities (frequencies & similarities)
3.  Score the machine output based on where it falls in this space
4.  Adapt to new annotation tasks

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Relation Extraction
crowdsourcing gold standard data
Relations overlap in meaning
Sentences are vague and ambiguous
Experts have different interpretations

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Feeling the way the CHEST expands (PALPATION), can identify areas of
the lung that are full of ﬂuid.
?PALPATIONIs CHEST related to
diagnose location associated
with
is_a otherpart_of
0 0 02 3 0 0 0 1 0 0 44 1
?CONJUNCTIVITISHYPERAEMIA related toIs
0 0 0 1 0 0 0 013 0 0 0 0 0
symptomcause
Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora)
of the eyes are symptoms common to all forms of CONJUNCTIVITIS.

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Harnessing Disagreement
• Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on
each sentence as the cosine of the unit vector for relation with sentence vector
• Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the
same relation for a sentence, the max score is 1, indicating a clear sentence
• Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as
well. Indicates how confusable the linguistic expression of two relations are
• Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score
• Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high
clarity score, it means that it is at least possible to express the relation clearly

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
The Dark Side of Crowdsourcing
Disagreement
• spammers generate disagreement for the wrong reasons
• most spam detection requires gold standard
• Worker-sentence disagreement: the average of all the cosines between each
worker’s sentence vector and the full sentence vector (minus that worker).
Indicates how much a worker disagrees with the crowd on a sentence basis
• Worker-worker disagreement: a pairwise confusion matrix between workers
and the average agreement across the matrix for each worker. Indicates
whether there are consistently like-minded workers

Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo
Questions?

The document discusses measuring quality in crowdsourced semantic interpretation tasks where there is disagreement between annotators. It introduces the concept of the "three sides of CrowdTruth" - representation, workers, and sentences. It shows that sentence quality and relation quality impact measurements of worker quality, and that considering these interdependencies can significantly improve worker metric accuracy in detecting spam annotators. Filtering both low-quality sentences and vague relations best separates high- and low-quality workers in evaluations.

Truth is a Lie: 7 Myths about Human Annotation @CogComputing Forum 2014

Lora Aroyo

Big data is having a disruptive impact across the sciences. Human annotation of semantic interpretation tasks is a critical part of big data semantics, but it is based on an antiquated ideal of a single correct truth that needs to be similarly disrupted.We expose seven myths about human annotation, most of which derive from that antiquated ideal of truth, and dispell these myths with examples from our research.We propose a new theory of truth, Crowd Truth, that is based on the intuition that human interpretation is subjective, and that measuring annotations on the same objects of interpretation (in our examples, sentences) across a crowd will provide a useful representation of their subjectivity and the range of reasonable interpretations.

Crowdsourcing & Semantic Web: Dagstuhl 2014 (Presentation Lora)

Lora Aroyo

Exploiting disagreement through open ended tasks for capturing interpretation...

Benjamin Timmermans

The document discusses using open-ended crowdsourcing tasks to capture interpretation spaces for multimedia data. It outlines challenges with existing approaches that assume a single ground truth and stimulate agreement. The methodology proposes using open-ended tasks without predefined answers to gather a range of interpretations. Preliminary results show capturing diverse sound interpretations through open-ended tasks to tag sounds. The conclusion is that open-ended tasks do not force agreement but can capture the full interpretation space for more complete multimedia data.

CCCT University of Amsterdam Seminars 2013: Crowdsourcing Session

Lora Aroyo

The document discusses harnessing disagreement in crowdsourcing to create gold standards for training cognitive systems. It describes representing disagreement through vectors for sentences, workers, and relations to understand different interpretations. Scoring systems can consider where their outputs fall within the space of possible interpretations represented by disagreement vectors. The goal is to capture and exploit disagreement rather than treating it as noise.

CrowdTruth: Machine-Human Computation for Harnessing Disagreement in Semantic...

Lora Aroyo

CrowdTruth is a machine-human computation platform that harnesses disagreement in semantic interpretation to help machines learn. It presents tasks to human workers to annotate examples for interpretation and analyzes levels of disagreement, which can indicate ambiguity, low quality work, or the clarity of examples. The open source CrowdTruth software includes components for machine preprocessing, reusable microtasks, and analytics on disagreement and provenance tracking to collect and evaluate ground truth data.

Good News is No News? Effects of Positive Stories about African Americans on ...

Miglena Sternadori

#CrowdTruth: Linked Data for Information Extraction @ISWC2015

Lora Aroyo

The document discusses how the social web and TV viewing are converging, with people using second screens like phones and tablets to discuss or comment on TV programs via social media. It describes the NoTube project, which aims to personalize TV interaction by using social and semantic web data to provide personalized recommendations. NoTube aggregates viewing data and profiles user interests to surface new, relevant programs while balancing predictability with serendipity. Key challenges include dealing with sparse, fragmented TV preference data on the open web.

Agora User Committee Meeting 2013

Lora Aroyo

The Agora project is a collaboration between the History and Computer Science departments at the VU University Amsterdam, the Rijksmuseum Amsterdam and the Dutch national audiovisual archive Beeld en Geluid. The aim of Agora is to develop a social platform in which museum objects can be placed into an explicit (art)historic context. Through the (art)historic context, objects from highly diverse museum collections can be related, resulting in a more complete and illustrated description of historical events. End-users will also be allowed to create their own personal narratives which will lead to theoretical reflection on the meaning of digitally mediated public history in contemporary society. Check out our website http://agora.cs.vu.nl/ and our twitter feed @agora_project

SealincMedia Accurator Demos

Lora Aroyo

Finding relevant multimedia content is notoriously difficult, and the difficulty increases with the size and heterogeneity of the content collection. Linked cultural media collections are heterogeneous by nature and rapidly increase in size, mainly through enormous amounts of user-generated content and metadata that are placed on the Internet on a daily basis. Without mechanisms for keeping any part of these collections easily accessible by any user at any time and any use context, the value of these collections for the community will drop, just like their value as an economic asset. demo: http://2-dot-rma-accurator.appspot.com/#Intro website: http://sealincmedia.wordpress.com/

AGORA Project: Final Review 2012

Lora Aroyo

The document provides an overview of the AGORA project, which aims to create a social platform where museum objects are placed in historical context using events and user-generated narratives. It discusses the team members, goals, key results including publications and demos produced. It also describes work on developing an event model and extracting events from text, as well as pilots conducted with university history students to test the AGORA demos.

CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...

Lora Aroyo

This document summarizes a master's thesis on developing a personalized mobile museum tour with real-time adaptation. The thesis aimed to improve on an existing offline personalized tour system by enabling real-time user positioning and tour adaptation based on time constraints, artwork preferences, and spatial information within the museum. It investigated using WiFi radio frequency fingerprinting for real-time localization, which achieved accuracy within 1.25 meters. The mobile tour system was designed to adapt the recommended artworks and tour pathing in real-time based on the user's profile and detected location within the museum.

Europeana Tech 2011

Michiel Hildebrand

The document discusses collecting and managing user-generated metadata for video content annotation. It describes how annotating videos is currently a time-consuming process requiring 5 times the duration of the video. It also discusses using crowdsourcing to generate coarse-grained annotations in a user vocabulary to better support finding video fragments. The document also examines linking user-generated annotations to concepts in the web of data.

Stitch by Stitch: Annotating Fashion at the Rijksmuseum

Lora Aroyo

https://www.rijksmuseum.nl/en/stitch-by-stitch http://annotate.accurator.nl/ Fashion can be found everywhere in museums. Fashion heritage collected over centuries: costumes, accessories, paintings, prints and photographs. But while some clothes and accessories are easily found and identified, others are obscure and require a trained eye to describe. What are we looking at? What kind of sleeve is this? Which materials and techniques have been used? More specific descriptions of the images facilitate better use of digital collections and enable users to wander through them in detail.

DIVE+: Explorative Search for Digital Humanities

Johan Oomen

DIVE+ is an event-centric linked data digital collection browser aimed to provide an integrated and interactive access to multimedia objects from various heterogeneous online collections. It enriches the structured metadata of online collections with linked open data vocabularies with focus on events, people, locations and concepts that are depicted or associated with particular collection objects. DIVE+ is result of a true inter-disciplinary collaboration between computer scientists, humanities scholars, cultural heritage professionals and interaction designers. The tool allows humanities scholars to explore unexpected relations between entities and media objects and to construct and share navigation paths to develop research narratives.

Dartmouth 2018 writing assessment presentation Les Perelman

Les Perelman

This document discusses issues related to writing assessment and its rhetoric. It covers key concepts in assessment such as validity, reliability, significance, and consequences. It also discusses types of validity like face validity, construct validity, and predictive validity. Additionally, it examines types of reliability including test-retest reliability and parallel forms reliability. The document cautions that assessments should avoid unintended consequences and biases that could adversely impact certain groups. It stresses the importance of clearly defining the purpose and goals of any assessment.

L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn

RwanEnan

This chapter introduces vector semantics for representing word meaning in natural language processing applications. Vector semantics learns word embeddings from text distributions that capture how words are used. Words are represented as vectors in a multidimensional semantic space derived from neighboring words in text. Models like word2vec use neural networks to generate dense, real-valued vectors for words from large corpora without supervision. Word vectors can be evaluated intrinsically by comparing similarity scores to human ratings for word pairs in context and without context.

School Essay Essays Format. Online assignment writing service.

Carolina Abrams

The document provides instructions for submitting an assignment request to the website HelpWriting.net. It outlines a 5-step process: 1) Create an account with an email and password. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions to ensure satisfaction, with a full refund option for plagiarized work.

Communities of Trust - from regulation to cooperation

Screamin Wrba

This document discusses the concept of "Communities of Trust" and how they can be built through cooperation rather than regulation or standardization. It outlines three levels that structure work - contracts, processes/tools, and individuals/interactions. While regulation and standardization aim to increase security and confidence, cooperation is needed to build trust at the team level. The document draws parallels between cooperation in human and animal societies, discussing concepts like reciprocal altruism. It also summarizes findings from a Google study that identified psychological safety - where all members speak proportionately and with social sensitivity - as key to building effective teams.

the relevance theory- pragmatics

kiran nazir

The document summarizes the key ideas of relevance theory, proposed by Dan Sperber and Deirdre Wilson. It argues that communication relies on implicit inferences rather than encoding of messages. There are two methods of communication - coded, where messages are encoded and decoded, and ostensive-inferential, where the communicator provides just enough information relying on the audience to infer the intended meaning based on context. Relevance theory explains ostensive-inferential communication, where new information is relevant if it has contextual implications, strengthens existing assumptions, or contradicts assumptions. Every act of communication implicitly presumes to be optimally relevant to the audience.

kiranppt-170704170919 (1).pdf

SemaYILDIZHUSEYNOV1

The document summarizes the key ideas of relevance theory proposed by Dan Sperber and Deirdre Wilson. It argues that communication relies on implicit inferences rather than encoding and decoding of messages. Relevance theory states that hearers will process communications until they find meaning that satisfies their expectation of relevance, then will stop processing. It describes two methods of communication - coded, where messages are encoded and decoded, and ostensive-inferential, where the speaker conveys just enough information relying on the hearer to infer implicit meanings from the context. The theory is influenced by Grice's cooperative principle and maxims of conversation. Relevance is defined based on processing effort required and contextual effects or implications.

RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020

Zachary Schendel

The document discusses Netflix's research into how people perceive similarity in recommended content. It found that perceptions are influenced by three factors: 1) the person seeing the recommendations and their past experiences, 2) the current context, and 3) where the recommendations are placed. By accounting for these factors, Netflix was able to create a new similarity model that resulted in fewer perceived unreliable recommendations from users.

Sample Self Evaluation Essay.pdf

Andrea Santiago

The passage discusses the challenges of writing a self-evaluation essay, noting that it requires balancing introspection with objective analysis, and showcasing accomplishments while acknowledging areas for improvement. It also must maintain a coherent narrative tone and consider the audience. Writing a self-evaluation essay is a nuanced task that demands introspection, self-awareness, and strong writing skills. The author must balance humility and confidence while crafting a cohesive narrative tailored to the audience.

Cbse Class 7 English Essay

Vanessa Henderson

The document provides instructions for creating an account and submitting a request for an assignment writing service on the website HelpWriting.net. Users must first register with a password and email, then complete a form with assignment details and deadline. Writers will bid on the request and the user can choose a writer based on qualifications. After receiving the paper, the user can request revisions if needed.

Essay On Exam Stress. Online assignment writing service.

Amanda Anderson

The Great Influenza by John Barry depicts the deadly 1918 influenza pandemic that killed over 100 million people worldwide in just 24 months. Barry illustrates how quickly the virus spread across the globe as people carried it from country to country. Despite medical researchers' efforts to understand and fight the epidemic, the influenza strain overwhelmed communities and healthcare systems. The book provides a detailed account of the origins, spread, and impact of the 1918 pandemic, the worst disease outbreak in history.

Example Of Event Report Essay

Emily Owusuansah

Recsys Presentation

Neal Lathia

This document proposes a method for private distributed collaborative filtering using estimated concordance measures. It defines concordance as a measure of agreement between users' ratings that can be used to estimate similarity in a privacy-preserving way. The method estimates upper and lower bounds on concordance between users to calculate similarity without revealing private rating data. An evaluation shows this approach can accurately estimate similarity coefficients and generate recommendations, especially for larger and denser datasets. Future work is needed to further analyze concordance-based similarity and its effects on trust in distributed recommender systems.

Xmas Writing Paper

Jennifer Perry

The document discusses two instances of infanticide in the novel The Woman Warrior by Maxine Hong Kingston. The first instance is in the chapter "No Name Woman" where an aunt drowns her baby girl to hide the shame of having a child out of wedlock. The second instance is when Brave Orchid recalls a baby born without an anus being left outside to die as there was no hope of it surviving. Both instances are used to provoke thought about what is considered right or wrong culturally and to demonstrate the will to survive even in doomed circumstances.

CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity

Anca Dumitrache

The document discusses modeling crowd truth by harnessing annotator disagreement as a signal rather than noise. It proposes measuring various metrics like sentence quality, relation quality, and worker quality to capture ambiguity and understand semantic interpretation. Capturing disagreement through these metrics can provide better training data for tasks like relation extraction compared to relying on a single expert gold standard. At least 10 workers per sentence are needed to obtain the highest quality annotations.

Viewers also liked

Keynote at SMAP2012: Personalized Access to TV Content

Lora Aroyo

Agora User Committee Meeting 2013

Lora Aroyo

SealincMedia Accurator Demos

Lora Aroyo

AGORA Project: Final Review 2012

Lora Aroyo

CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...

Lora Aroyo

Europeana Tech 2011

Michiel Hildebrand

Stitch by Stitch: Annotating Fashion at the Rijksmuseum

Lora Aroyo

DIVE+: Explorative Search for Digital Humanities

Johan Oomen

Viewers also liked (8)

Keynote at SMAP2012: Personalized Access to TV Content

Agora User Committee Meeting 2013

SealincMedia Accurator Demos

AGORA Project: Final Review 2012

CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...

Europeana Tech 2011

Stitch by Stitch: Annotating Fashion at the Rijksmuseum

DIVE+: Explorative Search for Digital Humanities

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing

Dartmouth 2018 writing assessment presentation Les Perelman

Les Perelman

L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn

RwanEnan

School Essay Essays Format. Online assignment writing service.

Carolina Abrams

Communities of Trust - from regulation to cooperation

Screamin Wrba

the relevance theory- pragmatics

kiran nazir

kiranppt-170704170919 (1).pdf

SemaYILDIZHUSEYNOV1

RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020

Zachary Schendel

Sample Self Evaluation Essay.pdf

Andrea Santiago

Cbse Class 7 English Essay

Vanessa Henderson

Essay On Exam Stress. Online assignment writing service.

Amanda Anderson

Example Of Event Report Essay

Emily Owusuansah

Recsys Presentation

Neal Lathia

Xmas Writing Paper

Jennifer Perry

CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity

Anca Dumitrache

Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16

Kimberly Gomez

This document provides instructions for requesting writing assistance from HelpWriting.net. It outlines a 5-step process: 1. Create an account by providing a password and email. 2. Complete a 10-minute order form with instructions, sources, deadline, and attaching a sample for style imitation. 3. Review bids from writers and choose one based on qualifications, history, and feedback. Place a deposit to start work. 4. Ensure the paper meets expectations and authorize final payment if pleased. Free revisions are provided. 5. Multiple revisions can be requested to ensure satisfaction. Plagiarized work results in a full refund. HelpWriting.net aims to fully meet customer needs.

Semantic Patterns for Sentiment Analysis of Twitter

Knowledge Media Institute - The Open University

This document summarizes a research paper on using semantic patterns for sentiment analysis of tweets. It proposes extracting patterns from the contextual semantics and sentiment of words in tweets. These semantic sentiment patterns (SS-Patterns) are then used as features for sentiment classification, achieving better performance than syntactic or semantic features. Evaluation on tweet and entity-level sentiment analysis tasks shows the SS-Patterns approach consistently outperforms baselines. Analysis finds the extracted patterns exhibit high within-pattern sentiment consistency.

Dialogue based Meaning Negotiation

Terry Payne

Recent advances in technology have caused a proliferation of data and knowledge sources on a global scale. The ability to access and integrate these knowledge sources is crucial for critical decision making, and to facilitate this, knowledge-based intelligent applications (agents) need to resolve the differences between their knowledge models (ontologies). We present preliminary work that allows two agents to jointly determine a single correspondence between two concepts in their respective ontologies, without the need for prior joint knowledge. The agents engage in a dialogue that permits the participants to exchange information about the concepts to support the assertion or rejection of a correspondence. This paper was presented at the 15th Workshop on Computational Models of Natural Argument, 2015. More details can be found at http://www.csc.liv.ac.uk/~trp/Knowledge-Based-Agents.html

Puppy Writing Stationary Writing, Puppies, Words

Michelle Adams

The document provides instructions for creating an account and requesting writing assistance on the HelpWriting.net site. It involves a 5-step process: 1) Create an account with a password and email; 2) Complete a form with assignment details and deadline; 3) Review bids from writers and choose one; 4) Receive the paper and authorize payment if satisfied; 5) Request revisions until satisfied. The purpose is to outline the simple process for students to get help writing assignments through the website.

IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.

Heather Wilkins

The document provides instructions for registering and using an online writing assistance service. It outlines a 5-step process: 1) Create an account with a password and email. 2) Complete a form with assignment details and attach samples. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions until satisfied, with a refund option for plagiarized work. The service utilizes a bidding system and promises original, high-quality content.

2000 Word Essay How Long Introduction. Online assignment writing service.

Tammy Adams

This document provides instructions for requesting an assignment writing service from HelpWriting.net in 5 steps: 1. Create an account with a password and email. 2. Complete a 10-minute order form with instructions, sources, and deadline. 3. Choose a writer based on their bid, qualifications, history, and feedback. 4. Review the completed paper and authorize payment if satisfied. 5. Request revisions to ensure satisfaction, and the company guarantees original, high-quality content or a full refund.

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing (20)

Dartmouth 2018 writing assessment presentation Les Perelman

L6.pptxsdv dfbdfjftj hgjythgfvfhjyggunghb fghtffn

School Essay Essays Format. Online assignment writing service.

Communities of Trust - from regulation to cooperation

the relevance theory- pragmatics

kiranppt-170704170919 (1).pdf

RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020

Sample Self Evaluation Essay.pdf

Cbse Class 7 English Essay

Essay On Exam Stress. Online assignment writing service.

Example Of Event Report Essay

Recsys Presentation

Xmas Writing Paper

CrowdTruth Tutorial: Using the Crowd to Understand Ambiguity

Size Of Writing Paper. Writing Paper Sizes Chart. 2019-01-16

Semantic Patterns for Sentiment Analysis of Twitter

Dialogue based Meaning Negotiation

Puppy Writing Stationary Writing, Puppies, Words

IndiaS Natural Beauty Essay In Hindi. Online assignment writing service.

2000 Word Essay How Long Introduction. Online assignment writing service.

More from Lora Aroyo

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf

Lora Aroyo

CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning

Lora Aroyo

The document introduces CATS4ML, a crowdsourcing challenge to discover blindspots in machine learning models by having participants label images in the Open Images Dataset that are incorrectly labeled by AI. The goal is to crowdsource adverse test sets that can capture biases and improve evaluation of AI. The challenge runs through April 2021 and invites individuals and teams to discover interesting mislabeled images and contribute them for review and inclusion in the test sets. Winning contributions will be promoted at the next CrowdCamp conference.

Harnessing Human Semantics at Scale (updated)

Lora Aroyo

The document appears to be a series of tweets and posts by Lora Aroyo discussing data science and crowdsourcing techniques. Some key points discussed include harnessing human semantics at scale through crowdsourcing and nichesourcing, measuring quality and reproducibility of crowdsourced results, and experimenting with different task designs and payment models to assess their impact. Specific examples mentioned include using crowdsourcing to add detailed annotations to museum collections and to find "blindspots" in AI models through a data challenge.

Data excellence: Better data for better AI

Lora Aroyo

The document discusses the importance of data quality and a data lifecycle approach for artificial intelligence. Some key points made include: - A data lifecycle is needed to guide best practices for data research and development, similar to how a software lifecycle guides software engineering. - Data quality must be addressed through practices and standards to help avoid unintended AI behaviors that can result from low quality data. - Disagreement in annotation tasks can provide valuable signals about ambiguity and diversity rather than just being considered noise. - Achieving high quality, reliable data requires consideration of aspects like validity, fidelity, reproducibility and maintaining data over time - an approach toward "data excellence".

CHIP Demonstrator presentation @ CATCH Symposium

Lora Aroyo

This document summarizes the CHIP project, which aims to use semantic metadata about cultural heritage objects to improve personalized access and recommendations for museum visitors. The CHIP approach involves making metadata and vocabularies available as RDF/OWL, aligning and enriching the data, and using it to build a combined user model for generating virtual and physical museum tours. Experiments show semantic relations can enhance content-based recommendations for novices and experts. Follow-up projects include Agora, deploying the techniques at the Rijksmuseum in Amsterdam.

Semantic Web Challenge: CHIP Demonstrator

Lora Aroyo

The Rijksmuseum Collection as Linked Data

Lora Aroyo

Presentation at ISWC2018: http://iswc2018.semanticweb.org/sessions/the-rijksmuseum-collection-as-linked-data/ of our paper published originally in the Semantic Web Journal: http://www.semantic-web-journal.net/content/rijksmuseum-collection-linked-data-2 Many museums are currently providing online access to their collections. The state of the art research in the last decade shows that it is beneficial for institutions to provide their datasets as Linked Data in order to achieve easy cross-referencing, interlinking and integration. In this paper, we present the Rijksmuseum linked dataset (accessible at http://datahub.io/dataset/rijksmuseum), along with collection and vocabulary statistics, as well as lessons learned from the process of converting the collection to Linked Data. The version of March 2016 contains over 350,000 objects, including detailed descriptions and high-quality images released under a public domain license.

Keynote at International Conference of Art Libraries 2018 @Rijksmuseum

Lora Aroyo

FAIRview: Responsible Video Summarization @NYCML'18

Lora Aroyo

Presentation at the NYC Media Lab (NYCML2018). There is a growing demand for news videos online, with more consumers preferring to watch the news than read or listen to it. On the publisher side, there is a growing effort to use video summarization technology in order to create easy-to-consume previews (trailers) for different types of broadcast programs. How can we measure the quality of video summaries and their potential to misinform? This workshop will inform participants about automatic video summarization algorithms and how to produce more “representative” video summaries. The research presented is from the FAIRview project and is supported by the Digital News Innovation Fund (DNI Fund), which is part of the Google News Initiative.

Understanding bias in video news & news filtering algorithms

Lora Aroyo

StorySourcing: Telling Stories with Humans & Machines

Lora Aroyo

This document discusses Lora Aroyo's work on using events and narratives to enhance access to cultural heritage collections. It describes early projects that linked cultural objects to events and entities to provide more context and engagement for online users. This led to work modeling historical events and extracting event properties and relationships to generate "proto-narratives". Later projects like DIVE and DIVE+ developed event-centric exploratory search tools and media suites. More recent efforts focus on crowdsourcing event tagging and curating to further engage audiences and remix archival stories. A key challenge discussed is the lack of standardized event vocabularies across cultural heritage communities.

Data Science with Humans in the Loop

Lora Aroyo

Digital Humanities Benelux 2017: Keynote Lora Aroyo

Lora Aroyo

This document discusses harnessing human semantics at scale through crowdsourcing and nichesourcing. It addresses making crowdsourcing efforts measurable, reproducible, engaging and sustainable. Some key points discussed are identifying crowdsourcing goals, assessing the impact of task and result designs, measuring quality and progress over time, and running continuous campaigns to reproduce and sustain results at scale.

DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...

Lora Aroyo

Crowdsourcing ambiguity aware ground truth - collective intelligence 2017

Lora Aroyo

The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to the volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, this assumption often creates issues in practice. Previous experiments we performed found that inter-annotator disagreement is usually never captured, either because the number of annotators is too small to capture the full diversity of opinion, or because the crowd data is aggregated with metrics that enforce consensus, such as majority vote. These practices create artificial data that is neither general nor reflects the ambiguity inherent in the data. To address these issues, we proposed the method for crowdsourcing ground truth by harnessing inter-annotator disagreement. We present an alternative approach for crowdsourcing ground truth data that, instead of enforcing an agreement between annotators, captures the ambiguity inherent in semantic annotation through the use of disagreement-aware metrics for aggregating crowdsourcing responses. Based on this principle, we have implemented the CrowdTruth framework for machine-human computation, that first introduced the disagreement-aware metrics and built a pipeline to process crowdsourcing data with these metrics. In this paper, we apply the CrowdTruth methodology to collect data over a set of diverse tasks: medical relation extraction, Twitter event identification, news event extraction and sound interpretation. We prove that capturing disagreement is essential for acquiring a high-quality ground truth. We achieve this by comparing the quality of the data aggregated with CrowdTruth metrics with a majority vote, a method which enforces consensus among annotators. By applying our analysis over a set of diverse tasks we show that, even though ambiguity manifests differently depending on the task, our theory of inter-annotator disagreement as a property of ambiguity is generalizable.

My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone

Lora Aroyo

Ambiguity in interpreting signs is not a new idea, yet the vast majority of research in machine interpretation of signals such as speech, language, images, video, audio, etc., tend to ignore ambiguity. This is evidenced by the fact that metrics for quality of machine understanding rely on a ground truth, in which each instance (a sentence, a photo, a sound clip, etc) is assigned a discrete label, or set of labels, and the machine’s prediction for that instance is compared to the label to determine if it is correct. This determination yields the familiar precision, recall, accuracy, and f-measure metrics, but clearly presupposes that this determination can be made. CrowdTruth is a form of collective intelligence based on a vector representation that accommodates diverse interpretation perspectives and encourages human annotators to disagree with each other, in order to expose latent elements such as ambiguity and worker quality. In other words, CrowdTruth assumes that when annotators disagree on how to label an example, it is because the example is ambiguous, the worker isn’t doing the right thing, or the task itself is not clear. In previous work on CrowdTruth, the focus was on how the disagreement signals from low quality workers and from unclear tasks can be isolated. Recently, we observed that disagreement can also signal ambiguity. The basic hypothesis is that, if workers disagree on the correct label for an example, then it will be more diﬃcult for a machine to classify that example. The elaborate data analysis to determine if the source of the disagreement is ambiguity supports our intuition that low clarity signals ambiguity, while high clarity sentences quite obviously express one or more of the target relations. In this talk I will share the experiences and lessons learned on the path to understanding diversity in human interpretation and the ways to capture it as ground truth to enable machines to deal with such diversity.

Data Science with Human in the Loop @Faculty of Science #Leiden University

Lora Aroyo

Software systems are becoming ever more intelligent and more useful, but the way we interact with these machines too often reveals that they don’t actually understand people. Knowledge Representation and Semantic Web focus on the scientific challenges involved in providing human knowledge in machine-readable form. However, we observe that various types of human knowledge cannot yet be captured by machines, especially when dealing with wide ranges of real-world tasks and contexts. The key scientific challenge is to provide an approach to capturing human knowledge in a way that is scalable and adequate to real-world needs. Human Computation has begun to scientifically study how human intelligence at scale can be used to methodologically improve machine-based knowledge and data management. My research is focusing on understanding human computation for improving how machine-based systems can acquire, capture and harness human knowledge and thus become even more intelligent. In this talk I will show how the CrowdTruth framework (http://crowdtruth.org) facilitates data collection, processing and analytics of human computation knowledge. Some project links: - http://controcurator.org/ - http://crowdtruth.org/ - http://diveproject.beeldengeluid.nl/ - http://vu-amsterdam-web-media-group.github.io/linkflows/

SXSW2017 @NewDutchMedia Talk: Exploration is the New Search

Lora Aroyo

Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age

Lora Aroyo

The document discusses harnessing crowds, niches, and professionals in the digital age. The key points are: - Software is becoming less important as data takes center stage; cultural institutions must know their data and crowds. - Different crowds have different expertise and abilities; nichesourcing can access specialized knowledge. - Crowdsourcing initiatives should be part of an overall strategy and integrated into existing systems. - Novel interactions and user-driven augmentations can empower users and align the digital and physical.

"Video Killed the Radio Star": From MTV to Snapchat

Lora Aroyo

The document discusses bridging the gap between people and the massive amount of online multimedia content. It proposes decomposing videos and images into smaller fragments and building a media graph to link these fragments based on semantic relationships. Both machine learning and crowdsourcing are used to analyze and enrich media with metadata at scale. The goal is to turn "mute" images and context-free videos into relationship-aware media that allows nonlinear exploration. This would provide a more engaging experience for online audiences.

More from Lora Aroyo (20)

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf

CATS4ML Data Challenge: Crowdsourcing Adverse Test Sets for Machine Learning

Harnessing Human Semantics at Scale (updated)

Data excellence: Better data for better AI

CHIP Demonstrator presentation @ CATCH Symposium

Semantic Web Challenge: CHIP Demonstrator

The Rijksmuseum Collection as Linked Data

Keynote at International Conference of Art Libraries 2018 @Rijksmuseum

FAIRview: Responsible Video Summarization @NYCML'18

Understanding bias in video news & news filtering algorithms

StorySourcing: Telling Stories with Humans & Machines

Data Science with Humans in the Loop

Digital Humanities Benelux 2017: Keynote Lora Aroyo

DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...

Crowdsourcing ambiguity aware ground truth - collective intelligence 2017

My ESWC 2017 keynote: Disrupting the Semantic Comfort Zone

Data Science with Human in the Loop @Faculty of Science #Leiden University

SXSW2017 @NewDutchMedia Talk: Exploration is the New Search

Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age

"Video Killed the Radio Star": From MTV to Snapchat

Recently uploaded

Climate Impact of Software Testing at Nordic Testing Days

Kari Kakkonen

My slides at Nordic Testing Days 6.6.2024 Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.

Monitoring Java Application Security with JDK Tools and JFR Events

Ana-Maria Mihalceanu

UiPath Test Automation using UiPath Test Suite series, part 6

DianaGray10

Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI. UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities. Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes. What will you get from this session? 1. Insights into integrating generative AI. 2. Understanding how this integration enhances test automation within the UiPath platform 3. Practical demonstrations 4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath Topics covered: What is generative AI Test Automation with generative AI and Open AI. UiPath integration with generative AI Speaker: Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

How to Get CNIC Information System with Paksim Ga.pptx

danishmna97

Communications Mining Series - Zero to Hero - Session 1

DianaGray10

This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered: • Communication Mining Overview • Why is it important? • How can it help today’s business and the benefits • Phases in Communication Mining • Demo on Platform overview • Q/A

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

Neo4j

Leonard Jayamohan, Partner & Generative AI Lead, Deloitte This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.

みなさんこんにちはこれ何文字まで入るの？40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの？えこ...

名前です男

TrustArc Webinar - 2024 Global Privacy Survey

TrustArc

How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024? In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores. See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe. This webinar will review: - The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey - The top challenges for privacy leaders, practitioners, and organizations in 2024 - Key themes to consider in developing and maintaining your privacy program

National Security Agency - NSA mobile device best practices

Quotidiano Piemontese

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

Zilliz

RESUME BUILDER APPLICATION Project for students

KAMESHS29

Presentation of the OECD Artificial Intelligence Review of Germany

innovationoecd

“I’m still / I’m still / Chaining from the Block”

Claudio Di Ciccio

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024

Neo4j

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Neo4j

Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.

Data structures and Algorithms in Python.pdf

TIPNGVN2

Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...

James Anderson

Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management. The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM). Speakers: Bob Boule Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle. Gopinath Rebala Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.

Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!

SOFTTECHHUB

As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.

Cosa hanno in comune un mattoncino Lego e la backdoor XZ?

Speck&Tech

ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune. Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile. BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).

A tale of scale & speed: How the US Navy is enabling software delivery from l...

sonjaschweigert1

Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved: - Reduction in onboarding time from 5 weeks to 1 day - Improved developer experience and productivity through actionable findings and reduction of false positives - Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO) Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production. We will cover: - How to remove silos in DevSecOps - How to build efficient development pipeline roles and component templates - How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence) - How to streamline operations with automated policy checks on container images

Recently uploaded (20)

Climate Impact of Software Testing at Nordic Testing Days

Monitoring Java Application Security with JDK Tools and JFR Events

UiPath Test Automation using UiPath Test Suite series, part 6

How to Get CNIC Information System with Paksim Ga.pptx

Communications Mining Series - Zero to Hero - Session 1

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

TrustArc Webinar - 2024 Global Privacy Survey

National Security Agency - NSA mobile device best practices

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

RESUME BUILDER APPLICATION Project for students

Presentation of the OECD Artificial Intelligence Review of Germany

“I’m still / I’m still / Chaining from the Block”

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Data structures and Algorithms in Python.pdf

Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...

Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!

Cosa hanno in comune un mattoncino Lego e la backdoor XZ?

A tale of scale & speed: How the US Navy is enabling software delivery from l...

WebSci2013 Harnessing Disagreement in Crowdsourcing

1. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo gathering gold standard annotations for relation extraction Crowd Truth Harnessing Disagreement in Crowdsourcing

2. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Gold Standard Assumption • typically in cognitive systems • for each annotated instance there is a single right answer • gold standard quality can be measured in inter-annotator agreement Let them disagree?

3. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Hypothesis Annotator disagreement is not noise, but signal. Not a problem to overcome but a source of information for machines Artificially restricting humans does not help machines to learn. They will learn better from diversity

4. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Position disagreement is a sign of intrinsic vagueness & ambiguity in human understanding

5. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Approach Principles 1.  Tolerate, capture & exploit disagreement 2.  Understand it by a space of possibilities (frequencies & similarities) 3.  Score the machine output based on where it falls in this space 4.  Adapt to new annotation tasks

6. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Relation Extraction crowdsourcing gold standard data Relations overlap in meaning Sentences are vague and ambiguous Experts have different interpretations

7. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo

8. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Feeling the way the CHEST expands (PALPATION), can identify areas of the lung that are full of ﬂuid. ?PALPATIONIs CHEST related to diagnose location associated with is_a otherpart_of 0 0 02 3 0 0 0 1 0 0 44 1 ?CONJUNCTIVITISHYPERAEMIA related toIs 0 0 0 1 0 0 0 013 0 0 0 0 0 symptomcause Redness (HYPERAEMIA), irritation (chemosis) and watering (epiphora) of the eyes are symptoms common to all forms of CONJUNCTIVITIS.

9. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo

10. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Harnessing Disagreement • Sentence-relation score: core crowd truth metric for relation extraction, measured for each relation on each sentence as the cosine of the unit vector for relation with sentence vector • Sentence clarity: for each sentence - max relation score for that sentence. If all the workers selected the same relation for a sentence, the max score is 1, indicating a clear sentence • Relation similarity: pairwise conditional probability that if relation Ri is annotated in a sentence, Rj is as well. Indicates how confusable the linguistic expression of two relations are • Relation ambiguity: max relation similarity for a relation. If a relation is clear it has low score • Relation clarity: max sentence-relation score for a relation over all sentences. If a relation has a high clarity score, it means that it is at least possible to express the relation clearly

11. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo The Dark Side of Crowdsourcing Disagreement • spammers generate disagreement for the wrong reasons • most spam detection requires gold standard • Worker-sentence disagreement: the average of all the cosines between each worker’s sentence vector and the full sentence vector (minus that worker). Indicates how much a worker disagrees with the crowd on a sentence basis • Worker-worker disagreement: a pairwise confusion matrix between workers and the average agreement across the matrix for each worker. Indicates whether there are consistently like-minded workers

12. Chris Welty Crowd Truth for Cognitive Computing Lora Aroyo Questions?

WebSci2013 Harnessing Disagreement in Crowdsourcing

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (8)

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing

Similar to WebSci2013 Harnessing Disagreement in Crowdsourcing (20)

More from Lora Aroyo

More from Lora Aroyo (20)

Recently uploaded

Recently uploaded (20)

WebSci2013 Harnessing Disagreement in Crowdsourcing