Political risk outlook investment pack - August 2014Damian Karmelich
The Political Monitor August political risk outlook investment pack includes the Australian Political Risk Index, political risk spreads, country risk analysis for China, India and Indonesia, an examination of geo-political events on oil prices and a review of political risk across Africa.
Microorganismos multiresistentes (marsa y acinetibacter baumannii)María Belén Chacón
Dos de los microorganismos causantes de enfermedad nosocomial que más preocupan en la actualidad por la complejidad que presentan para ser tratados con antimicrobianos por su multirresintencia.
Political risk outlook investment pack - August 2014Damian Karmelich
The Political Monitor August political risk outlook investment pack includes the Australian Political Risk Index, political risk spreads, country risk analysis for China, India and Indonesia, an examination of geo-political events on oil prices and a review of political risk across Africa.
Microorganismos multiresistentes (marsa y acinetibacter baumannii)María Belén Chacón
Dos de los microorganismos causantes de enfermedad nosocomial que más preocupan en la actualidad por la complejidad que presentan para ser tratados con antimicrobianos por su multirresintencia.
¿cómo puedo consolidar un nuevo hábito en mi vida?
¿de dónde extraigo la fuerza de voluntad y el tiempo para ello?
aquí tienes 8 sencillos pasos para consolidad un hábito en tus rutinas diarias
Posicionamientos estratégicos que se pueden ocupar en la actualidad, con la llegada del Internet, la desregulación y desagregación de los mercados asi como, con la digitalización de la información, este enfoque innovador de estrategias de negocios es un giro mas o adicional a la tuerca, respecto al planteamiento de las estrategias genéricas de Michael Porter, si tienen dudas no duden en tomar contacto conmigo al EMAIL, saludos
Version longue en PDF de la présentation sur l'employabilité des traducteurs et sur la transition réussie entre formation et profession : explication des diapos et plus de 400 liens actifs vers des ressources de qualité !
Entrepreneurship als zentrales Bildungsziel. Probleme und Chancen. Ein Erfahr...entrepreneurship.ch
Eine Präsentation von Inge Faltin (3 rd solutions) an der Tagung "Kick your brain! Entrepreneurship-Education für Jugendliche in der Berufsausbildung", die am 15.11.2011 in Zürich stattfand. Diese Veranstaltung war eine Veranstaltung im Rahmen der Global Entrepreneurship Week.
There’s An App For That: Why Every Team Has Their Own Tools - Leon Adato, Sol...DevOpsDays Tel Aviv
Why do companies seem to collect monitoring solutions like pokemon cards? Because it's easier for a group to buy their own tool than to try to work with another group and share an existing system, and at the end of the day they are only going to trust the system that they "own". The good news is that fear and trust issues are two things which DevOps culture is designed to combat and overcome!
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
Software systems are becoming ever more intelligent and more useful, but the way we interact with these machines too often reveals that they don’t actually understand people. Knowledge Representation and Semantic Web focus on the scientific challenges involved in providing human knowledge in machine-readable form. However, we observe that various types of human knowledge cannot yet be captured by machines, especially when dealing with wide ranges of real-world tasks and contexts. The key scientific challenge is to provide an approach to capturing human knowledge in a way that is scalable and adequate to real-world needs. Human Computation has begun to scientifically study how human intelligence at scale can be used to methodologically improve machine-based knowledge and data management. My research is focusing on understanding human computation for improving how machine-based systems can acquire, capture and harness human knowledge and thus become even more intelligent. In this talk I will show how the CrowdTruth framework (http://crowdtruth.org) facilitates data collection, processing and analytics of human computation knowledge.
Some project links:
- http://controcurator.org/
- http://crowdtruth.org/
- http://diveproject.beeldengeluid.nl/
- http://vu-amsterdam-web-media-group.github.io/linkflows/
¿cómo puedo consolidar un nuevo hábito en mi vida?
¿de dónde extraigo la fuerza de voluntad y el tiempo para ello?
aquí tienes 8 sencillos pasos para consolidad un hábito en tus rutinas diarias
Posicionamientos estratégicos que se pueden ocupar en la actualidad, con la llegada del Internet, la desregulación y desagregación de los mercados asi como, con la digitalización de la información, este enfoque innovador de estrategias de negocios es un giro mas o adicional a la tuerca, respecto al planteamiento de las estrategias genéricas de Michael Porter, si tienen dudas no duden en tomar contacto conmigo al EMAIL, saludos
Version longue en PDF de la présentation sur l'employabilité des traducteurs et sur la transition réussie entre formation et profession : explication des diapos et plus de 400 liens actifs vers des ressources de qualité !
Entrepreneurship als zentrales Bildungsziel. Probleme und Chancen. Ein Erfahr...entrepreneurship.ch
Eine Präsentation von Inge Faltin (3 rd solutions) an der Tagung "Kick your brain! Entrepreneurship-Education für Jugendliche in der Berufsausbildung", die am 15.11.2011 in Zürich stattfand. Diese Veranstaltung war eine Veranstaltung im Rahmen der Global Entrepreneurship Week.
There’s An App For That: Why Every Team Has Their Own Tools - Leon Adato, Sol...DevOpsDays Tel Aviv
Why do companies seem to collect monitoring solutions like pokemon cards? Because it's easier for a group to buy their own tool than to try to work with another group and share an existing system, and at the end of the day they are only going to trust the system that they "own". The good news is that fear and trust issues are two things which DevOps culture is designed to combat and overcome!
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
Software systems are becoming ever more intelligent and more useful, but the way we interact with these machines too often reveals that they don’t actually understand people. Knowledge Representation and Semantic Web focus on the scientific challenges involved in providing human knowledge in machine-readable form. However, we observe that various types of human knowledge cannot yet be captured by machines, especially when dealing with wide ranges of real-world tasks and contexts. The key scientific challenge is to provide an approach to capturing human knowledge in a way that is scalable and adequate to real-world needs. Human Computation has begun to scientifically study how human intelligence at scale can be used to methodologically improve machine-based knowledge and data management. My research is focusing on understanding human computation for improving how machine-based systems can acquire, capture and harness human knowledge and thus become even more intelligent. In this talk I will show how the CrowdTruth framework (http://crowdtruth.org) facilitates data collection, processing and analytics of human computation knowledge.
Some project links:
- http://controcurator.org/
- http://crowdtruth.org/
- http://diveproject.beeldengeluid.nl/
- http://vu-amsterdam-web-media-group.github.io/linkflows/
A conference on how to engage the publics of sociotechnical controversies in the effort of controversy mapping.
I have been invited to give this conference at the 2012 4S conference on Science and Technology Studies (Copenhague - 18/10/12), at the 'Tactics of Issue Mapping' seminar of Goldsmith University (London - 26/10/12), at the Department of Media Studies of the University of Amsterdam (17/04/13) and at the Ecsite Conference on science centres and museums (Gothenburg - 08/06/13).
In the first weeks of this class, we have discussed the concept MalikPinckney86
In the first weeks of this class, we have discussed the concept of the history of sexuality. Write a short critical response paper (500+ words) in which you explain the problems inherent in defining and identifying sexual identity (as it relates to the category “homosexual,”) OR sex/gender identity (“intersex”), particularly in light of the essentialism/social construction debate or, colloquially, nature vs. nurture.
Whose position do you find the most compelling, and what are the political implications of these theories? Do you prefer Foucault or Jagose?
Your essay is an opportunity for you to reflect on one or more key concepts through a close critical reading of one text, or a comparison of two. You may wish to pursue a theoretical question in more detail in your critical response paper. Make sure you tell me which text you are choosing. You can pick any of the ones from the modules we've covered (including recommended texts). Again, you can choose one to pick apart. For instance, I can spend time describing an articles and compare them to one another, describing how similar or different they are in how they deal with their respective subjects.
Your essay should have a thesis and you should develop at least 3 main sub-points. In-text citations are necessary as well as a works cited listing the source you used. Any citation style is fine as long as it is consistent.
It is not necessary to comprehensively critique every perspective. If you absolutely have NO IDEA what to talk about, I have provided a list. you are, however, NOT BOUND BY THIS LIST! You do not have to pick one of these items, you are absolutely free to explore your own topic. You also do not have to cover everything on this list. These are just thesissuggestions.
PossibleQuestions/Topics to Address:
· The “birth of the homosexual” – when? What defines a homosexual?
· Medical discourse and the pathologization of non-heterosexual (queer) sexual practices/identities
· Medical discourse and the pathologization of non-normatively sexed bodies
· Distinction between sexual practice and sexual identity – the cultural construction & meaning of sexuality, from ancient Greece to the present day
· Capitalism and its impact on sexuality
· Sexuality and social construction – are “homosexuals” born or made?
CASE STUDY ASSIGNMENT: Albaker architects
Select an architecture firm that you would be interested in (Albaker Architects) working for
and prepare a case study of that firm. Consider factors such as location (where do you want
to live after graduation?), firm size, specialty, and type of projects. This is an illustrated
essay, no more than three pages in length (800 - 1000 words). Use graphics and diagrams
to tell the story of the firm. The Firm Details and Analysis questions listed below are
suggestions to stimulate your research. A strong case study will accurately reflect the firm.
Imagine if you sent it to a member of the firm, they would see a thoughtful, well ...
Towards a Framework for XR Ethics - Kent Bye, AWE, November 11, 2021Kent Bye
For all the ways that immersive technologies can be used for good, they can be used for evil. This talk will provide some conceptual frames for making sense of the landscape of XR ethical dilemmas including human rights principles, tradeoffs between contextual dimensions, and mapping relationships between techno-social, political, and economic domains. This talk will be reporting back on some of the work done by the IEEE Global Initiative on the Ethics of Extended Reality, as well as provide insights into how to integrate ethically-aligned design and responsible innovation best practices into your experiential design process.
The Rijksmuseum Collection as Linked DataLora Aroyo
Presentation at ISWC2018: http://iswc2018.semanticweb.org/sessions/the-rijksmuseum-collection-as-linked-data/ of our paper published originally in the Semantic Web Journal: http://www.semantic-web-journal.net/content/rijksmuseum-collection-linked-data-2
Many museums are currently providing online access to their collections. The state of the art research in the last decade shows that it is beneficial for institutions to provide their datasets as Linked Data in order to achieve easy cross-referencing, interlinking and integration. In this paper, we present the Rijksmuseum linked dataset (accessible at http://datahub.io/dataset/rijksmuseum), along with collection and vocabulary statistics, as well as lessons learned from the process of converting the collection to Linked Data. The version of March 2016 contains over 350,000 objects, including detailed descriptions and high-quality images released under a public domain license.
FAIRview: Responsible Video Summarization @NYCML'18Lora Aroyo
Presentation at the NYC Media Lab (NYCML2018). There is a growing demand for news videos online, with more consumers preferring to watch the news than read or listen to it. On the publisher side, there is a growing effort to use video summarization technology in order to create easy-to-consume previews (trailers) for different types of broadcast programs. How can we measure the quality of video summaries and their potential to misinform? This workshop will inform participants about automatic video summarization algorithms and how to produce more “representative” video summaries. The research presented is from the FAIRview project and is supported by the Digital News Innovation Fund (DNI Fund), which is part of the Google News Initiative.
DH Benelux 2017 Panel: A Pragmatic Approach to Understanding and Utilising Ev...Lora Aroyo
Lora Aroyo, Chiel van den Akker, Marnix van Berchum, Lodewijk
Petram, Gerard Kuys, Tommaso Caselli, Jacco van Ossenbruggen, Victor de Boer, Sabrina Sauer, Berber Hagedoorn
Crowdsourcing ambiguity aware ground truth - collective intelligence 2017Lora Aroyo
The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to the volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, this assumption often creates issues in practice. Previous experiments we performed found that inter-annotator disagreement is usually never captured, either because the number of annotators is too small to capture the full diversity of opinion, or because the crowd data is aggregated with metrics that enforce consensus, such as majority vote. These practices create artificial data that is neither general nor reflects the ambiguity inherent in the data.
To address these issues, we proposed the method for crowdsourcing ground truth by harnessing inter-annotator disagreement. We present an alternative approach for crowdsourcing ground truth data that, instead of enforcing an agreement between annotators, captures the ambiguity inherent in semantic annotation through the use of disagreement-aware metrics for aggregating crowdsourcing responses. Based on this principle, we have implemented the CrowdTruth framework for machine-human computation, that first introduced the disagreement-aware metrics and built a pipeline to process crowdsourcing data with these metrics.
In this paper, we apply the CrowdTruth methodology to collect data over a set of diverse tasks: medical relation extraction, Twitter event identification, news event extraction and sound interpretation. We prove that capturing disagreement is essential for acquiring a high-quality ground truth. We achieve this by comparing the quality of the data aggregated with CrowdTruth metrics with a majority vote, a method which enforces consensus among annotators. By applying our analysis over a set of diverse tasks we show that, even though ambiguity manifests differently depending on the task, our theory of inter-annotator disagreement as a property of ambiguity is generalizable.
My ESWC 2017 keynote: Disrupting the Semantic Comfort ZoneLora Aroyo
Ambiguity in interpreting signs is not a new idea, yet the vast majority of research in machine interpretation of signals such as speech, language, images, video, audio, etc., tend to ignore ambiguity. This is evidenced by the fact that metrics for quality of machine understanding rely on a ground truth, in which each instance (a sentence, a photo, a sound clip, etc) is assigned a discrete label, or set of labels, and the machine’s prediction for that instance is compared to the label to determine if it is correct. This determination yields the familiar precision, recall, accuracy, and f-measure metrics, but clearly presupposes that this determination can be made. CrowdTruth is a form of collective intelligence based on a vector representation that accommodates diverse interpretation perspectives and encourages human annotators to disagree with each other, in order to expose latent elements such as ambiguity and worker quality. In other words, CrowdTruth assumes that when annotators disagree on how to label an example, it is because the example is ambiguous, the worker isn’t doing the right thing, or the task itself is not clear. In previous work on CrowdTruth, the focus was on how the disagreement signals from low quality workers and from unclear tasks can be isolated. Recently, we observed that disagreement can also signal ambiguity. The basic hypothesis is that, if workers disagree on the correct label for an example, then it will be more difficult for a machine to classify that example. The elaborate data analysis to determine if the source of the disagreement is ambiguity supports our intuition that low clarity signals ambiguity, while high clarity sentences quite obviously express one or more of the target relations. In this talk I will share the experiences and lessons learned on the path to understanding diversity in human interpretation and the ways to capture it as ground truth to enable machines to deal with such diversity.
Crowdsourcing & Nichesourcing: Enriching Cultural Heritagewith Experts & Cr...Lora Aroyo
Presentation at the "Past, Present and Future of Digital Humanities & Social Sciences in the Netherlands" event, http://www.ehumanities.nl/past-present-and-future-of-digital-humanities-social-sciences-in-the-netherlands-programme-and-abstracts-2/
Stitch by Stitch: Annotating Fashion at the RijksmuseumLora Aroyo
https://www.rijksmuseum.nl/en/stitch-by-stitch
http://annotate.accurator.nl/
Fashion can be found everywhere in museums. Fashion heritage collected over centuries: costumes, accessories, paintings, prints and photographs. But while some clothes and accessories are easily found and identified, others are obscure and require a trained eye to describe. What are we looking at? What kind of sleeve is this? Which materials and techniques have been used? More specific descriptions of the images facilitate better use of digital collections and enable users to wander through them in detail.
Museums & the Web 2016 Presentation: Enriching Collections with Expert Knowle...Lora Aroyo
http://mw2016.museumsandtheweb.com/proposal/accurator-enriching-collections-with-expert-knowledge-from-the-crowd/
Crowdsourcing is not a new phenomenon for museums. There are good examples for museums (e.g., Powerhouse museum, steve.museum). But not all crowdsourcing initiatives are successful. Crowdsourced tagging does not always contribute to a better understanding of art and can even be confusing.
The Rijksmuseum and the VU University Amsterdam developed the Accurator: a visual tool to get experts in domains like birds, bibles, ships, castles, etc. involved in annotating art and enrich the museums’ metadata with expertise that is not available internally.
In this how-to session, we demonstrate the tool and the ways other museums can implement this Open Web application for their own collections.
Achieving Expert-Level Annotation Quality with CrowdTruth: The Case of Medical Relation Extraction. Anca Dumitrache, Lora Aroyo and Chris Welty. ==> http://ceur-ws.org/Vol-1428/
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
2. “The
Gallery
of
Cornelis
van
der
Geest”
(Willem
van
Haecht)
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
3. hun<ng
vs.
dogs
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
4. events
vs.
people
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
5. DIGITAL
HERMENEUTICS
theory of interpretation: relation parts of wholes
events as context for interpretation of online collections
intersection of hermeneutics & Web technology
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
6. Linking to Events
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
8. So
far
so
good,
but
….
L.
Aroyo,
C.
Welty:
Truth
is
a
Lie:
7
Myths
about
Human
Annota;on,
AI
Magazine
2014
(in
press).
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
9. Events
are
Vague
people have no clear notion of what events are
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
10. Events
have
Perspec+ves
and people don’t always agree
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
11. “Event can refer to many things such as: An observable
occurrence, phenomenon or an extraordinary occurrence.”
“an event is an incident that's very important or monumental”
“A planned public or social get together or occasion.”
“An event is something occurring at a specific time and/or
date to celebrate or recognize a particular occurrence.”
“a location where something like a function is held. you could tell
if something is an event if there people gathering for a purpose.”
If you ask the crowd ...
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
12. “an
event
is
the
exemplifica;on
of
a
property
by
a
substance
at
a
given
;me” Jaegwon
Kim,
1966
“events
are
changes
that
physical
objects
undergo”
Lawrence
Lombard,
1981
“events
are
proper;es
of
spa;otemporal
regions”,
David
Lewis,
1986
under30ceo.com
If you ask the experts ...
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
13. under30ceo.com
People are the ones who search
& determine relevance
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
14. Experts
vs.
Crowd?
Medical Relation Extraction Task (Aroyo, Welty 2014)
• 91% of expert annotations covered by the crowd
• expert annotators agree only in 30%
• popular crowd vote covers 95% of expert agreement
Waisda? Video Tagging (Gligorov et al. 2011)
• 14% tags in search logs are in professional vocab (GTAA)
• huge gap between expert & lay users’ views on what’s
important
Steve.Museum Project (Leason 2009)
• 14% user tags are in expert-curated documentation
under30ceo.com
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
15. CrowdTruth
Based on annotator disagreement as an indication of
the variation in human semantic interpretation of
signs, and can indicate ambiguity, vagueness, over-generality,
etc.
crowdtruth.org
L. Aroyo, C. Welty: Crowd Truth: Harnessing disagreement in crowdsourcing a relation extraction gold standard. ACM WebSci 2013.
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
16. CrowdTruth Framework for News Event Extraction
O.Inel, K.Khamkham, T.Cristea, A.Rutjes, J.van der Ploeg, L.Aroyo, R. Sips, A.Dumitrache, L.Romaszko: CrowdTruth: Machine-
Human Computation Framework for Harnessing Disagreement in Gathering Annotated Data. ISWC 2014.
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
17. News Event Extraction
The police came to Apple’s glass cube on Fifth Avenue
on Tuesday to enforce order after activists released
black balloons inside the cube to [protest] the
company’s environmental policies.
The police came to Apple’s glass cube on Fifth Avenue
on Tuesday [to enforce] order after activists released
black balloons inside the cube to protest the company’s
environmental policies.
The police came to Apple’s glass cube on Fifth Avenue
on Tuesday [to enforce order] after activists released
black balloons inside the cube to protest the company’s
environmental policies.
The police came to Apple’s glass cube on Fifth Avenue
on Tuesday to enforce order after activists [released]
black [balloons] inside the cube to protest the
company’s environmental policies.
The police [came]to Apple’s glass cube on Fifth Avenue
on Tuesday [to enforce] order after activists released
black balloons inside the cube to protest the company’s
environmental policies.
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
18. Video Event Extraction
Following the grandeur of Baroque, Rococo
art is often dismissed as frivolous and
unserious, but Waldemar Januszczak
disagrees. […] The first episode is about
travel in the 18th century and how it
impacted greatly on some of the finest art
ever made. The world was getting smaller and
took on new influences shown in the glorious
Bavarian pilgrimage architecture,
Canaletto's romantic Venice and the
blossoming of exotic designs and tastes all
over Europe.
Rococo: Travel, pleasure, madness
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
19. Events have multiple DIMENSIONS
Micro-task Template
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
20. Each DIMENSION has different GRANULARITY
Micro-task Template
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
21. People have different POINTS OF VIEWS
Micro-task Template
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
22. Triangle of Reference
Sign
Reference Observer
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
23. Triangle of Reference
to Capture Disagreement
Sentence
Annotation Task Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
24. CrowdTruth Metrics
Event Extraction
Three parts to understand human interpretations:
Sentence
How good is a sentence for the event extraction task?
Workers
How well does a worker understand the sentence?
Relations
Is the meaning of the event type clear?
How ambiguous/confusable is it?
Aroyo, L., Welty, C.: (2014) The Three Sides of CrowdTruth. Journal of Human Computation
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
25. Crowd Truth Metrics
based on the Triangle of Reference
Three parts to understand human interpretations:
Sign
How good is a sign for conveying information?
People
How well does a person understand the sign?
Ontology
Are the distinctions of the ontology clear?
How ambiguous/confusable are they?
Aroyo, L., Welty, C.: (2014) The Three Sides of CrowdTruth. Journal of Human Computation
Lora Aroyo Crowd Truth for Cognitive Computing Chris Welty
26. Disagreement Analytics
• sentence metrics: sentence clarity, sentence-relation score
• annotation task metrics: event clarity, type similarity, relation
ambiguity
• worker metrics:
o worker-sentence disagreement
o worker-worker disagreement
o avg number of annotations per sentence
o valid words in explanation text
o same explanation across contributions
o “[OTHER]” + different type
o time to complete, number of sentences, etc.
Aroyo, L., Welty, C.: (2013) Measuring crowd truth for medical relation extraction. AAAI Fall Symposium on Semantics for Big Data
http://www.americanprogress.org/wp-content/uploads/2012/12/multiple_http://lora-aroyo.org http://slideshare.net/laroyo @laroyo measures_onpage.jpg
27. Spam Detection
o filter sentences on their clarity score: to avoid penalizing workers
for contributing on ambiguous sentences
o bad sentences are removed = increase of accuracy on spam
detection
o apply worker metrics to analyze worker agreement: workers
who systematically disagree
o with majority (worker-sentence disagreement)
o with rest of co-workers (worker-worker disagreement)
o spammers annotations are removed = improvement of accuracy of
sentence metrics
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
28. Annotation Example
Around 2:30 p.m., as if delivering birthday greetings, several Greenpeace
demonstrators [ENTERED] the cube clutching helium-filled balloons, which
were the shape and color of charcoal briquettes.
Overall annotation & granularity distribution:
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
29. Event Type Disagreement
Around 2:30 p.m., as if delivering birthday greetings, several Greenpeace
demonstrators [ENTERED] the cube clutching helium-filled balloons, which
were the shape and color of charcoal briquettes.
[ENTERED]
ACTION (18.2%)
MOTION (9.1%)
ARRIVING_OR_
DEPARTING (54.5%)
PURPOSE (18.2%)
type
type
Sentence
type type
Ontology Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
30. Event Location Disagreement
Around 2:30 p.m., as if delivering birthday greetings, several Greenpeace
demonstrators [ENTERED] the cube clutching helium-filled balloons, which
were the shape and color of charcoal briquettes.
[ENTERED]
the cube
(38.5%)
cube
(38.5%)
none
(23%)
OTHER (100%)
NOT
APPLICABLE
(100%)
type
type
COMMERCIAL
(40%)
OTHER (40%)
INDUSTRIAL
(20%)
type
type
type
Sentence
Ontology Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
31. Around 2:30 p.m., as if delivering birthday greetings, several Greenpeace
demonstrators [ENTERED] the cube clutching helium-filled balloons, which
were the shape and color of charcoal briquettes.
[ENTERED]
Event Time Disagreement
Around
(9.1%)
Around
2:30 p.m.
(45.45%)
2:30 p.m.
(45.45%)
TIMESTAM
P (100%)
TIMESTAM
P (100%)
type
type
TIMESTAM
P (100%)
type
Sentence
Ontology Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
32. Event Participant Disagreement
Around 2:30 p.m., as if delivering birthday greetings, several Greenpeace
demonstrators [ENTERED] the cube clutching helium-filled balloons, which
were the shape and color of charcoal briquettes.
[ENTERED]
Greenpeace
(15.39%)
demonstrators
(15.39%)
Greenpeace
demonstrators
(69.23%)
ORGANIZATION
(100%)
PERSON (100%)
type
type
ORGANIZATION
(77.77%)
PERSON
(22.22%)
type
type
Sentence
Ontology Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
33. Comparative Annotation Distribution
Event Type Distribution Time Type Distribution
Sentence
Ontology Worker
The high disagreement for event type across all sentences likely indicates problems with the ontology. These
event types are difficult to distinguish between. The event classes may overlap, be confusable, too vague, etc.
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
34. Comparative Annotation Distribution
Location Type Distribution Participant Type Distribution
Sentence
Ontology Worker
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
35. Challenges
● Defining relevance, e.g. relevant
or related events, entities, videos
● Depicted vs. associated
relevance, e.g. in video, in audio
● Deal with reliability, e.g.
provenance
● Visualize quality analytics, e.g.
multidimensionality
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
36. Events Cultural Heritage Exploration
http://dive.beeldengeluid.nl/
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
37. Events @
• Agora: Historical Events in Cultural Heritage Collections
– http://agora.cs.vu.nl/
• Extractivism: Activist Events in Newspapers
– http://mona-project.org/
• Semantics of History
– http://www2.let.vu.nl/oz/cltl/semhis/
• BiographyNet: Events Change in Perspective over Time
• NewsReader: Multilingual Events & Storylines in Newspapers
– http://www.newsreader-project.eu/
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
38. Conclusions
● Events are just one example for diversity of human
interpretations
● Understanding crowd disagreement helps understand event
semantics
● Considering the interdependence of the different aspects of
the annotations improves their quality
● Disagreement metrics adaptable across domains - helped
us to understand the vagueness and the clarity of a sentence/
putative event
http://lora-aroyo.org http://slideshare.net/laroyo @laroyo
Sentence
Annotation task Worker