SlideShare a Scribd company logo
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Merging Crowdsourcing and Computational
Approaches for Digital Humanities
A Case of Mark Twain Translations
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh
The 4th International Scientific Conference
Information Science in the Age of Change
Warsaw, 15th – 16th May 2017
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Outline
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Objectives
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Needs
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Corpora and Ressources Dimensions
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Types of methods
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
Tasks of Natural Language Processing
Natural Language Processing
processes language material (e.g., text documents)
to perform useful tasks
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
NLP Tasks for Digital Humanities
Multilingual Text Analysis
Automatic Construction of Multilingual Knowledge
Lexicons
Terminologies
Ontologies
etc.
Machine Translation
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
Parallel Corpora
D´efinition
A parallel corpus is a corpus that contains a collection of original
texts in language L1 and their translations into a set of languages
L2...Ln
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Which Types of Tasks
Which Types of Resources
Annotated Parallel Corpora
Entities
Relation markers
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Linguistic level
Corpora : Texts, Sentences, Sentence segments
Lexical : Words and terms
Language
well-endowed languages
under resourced languages
Domain
general-purpose resources
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Human Construction of Language Resources
The ’traditional’ way
Writing a lexicon
Writing a thesaurus
Writing a grammar
Patterns, local grammar
Phrase-structure rules
Lexico-Syntactico-Semantic patterns or rules
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Collective Human Intelligence for Language Resources
construction
Crowdsourcing is ”the act of a company or institution taking a
function once performed by employees and outsourcing it to an
undefined (and generally large) network of people in the form of an
open call.”[Howe, 2006]
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Crowdsourcing
Crowdsourcing is ”the act of a company or institution taking a
function once performed by employees and outsourcing it to an
undefined (and generally large) network of people in the form of an
open call.”[Howe, 2006]
no a priori selection of the participants (”open call”)
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Crowdsourcing
Crowdsourcing is ”the act of a company or institution taking a
function once performed by employees and outsourcing it to an
undefined (and generally large) network of people in the form of an
open call.”[Howe, 2006]
no a priori selection of the participants (”open call”)
massive (in production and participation)
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Crowdsourcing
Crowdsourcing is ”the act of a company or institution taking a
function once performed by employees and outsourcing it to an
undefined (and generally large) network of people in the form of an
open call.”[Howe, 2006]
no a priori selection of the participants (”open call”)
massive (in production and participation)
(relatively) cheap
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Crowdsourcing model
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Previous Works : [Fraisse and al., 2014]
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Traditional Human Creation
Crowdsourcing
Our Approach
2-Step Approach for Resources Construction
Step 1 : Building initial translations core
Crawling Open bases and online source to collect :
the source version of a literary text and
its translations into a number of well-endowed languages
(such as French, German, or Spanish).
Step 2 : Data Enrichment
Incrementally extend this core to other languages through
crowdsourcing data collection tasks, which, should allow us to
collect translations into languages that would otherwise be
inaccessible.
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Mark Twain’s Adventures of Huckleberry Finn
Why ?
The digitization of the writings of American author Mark
Twain (1835-1910) is already very much advanced.
Adventures of Huckleberry Finn, deals with transnational and
universal topics such as slavery, freedom, childhood, racism,
and coming of age ; this focus, combined with the astounding
number of translations available, make it an ideal text to use
for the prototype in an investigation of the global circulation
of a literary text.
Large portions of his writings are now in the public domain
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
2-Steps for building Parallel Mark Twain’s Corpora
1 Collect the English source text of Adventures of Huckleberry
Finn (English) and its translations into a number of
well-endowed languages (we started by French)
Using open bases offered by National Libraries or any other
online source.
2 Use crowdsourcing to collect translations into languages that
would otherwise be inaccessible.
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Data Collection Tasks
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Data Collection Tasks
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Translations Tasks
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
The Deep Maps Model
(Shelley Fisher Fishkins, 2011)
Deep Maps would embed links to archival texts and images in
nodes on an interactive map. To construct them, scholars would
mine digital archives around the world for material to include as
links, using the durable URL of the text or image in the digital
archive in which it resides, as well as additional relevant source
information (including the online citation and, if available, the
original print source of the text or image as indicated in the online
source where it is found).
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
The Deep Maps Model
(Shelley Fisher Fishkins, 2011)
Deep Maps would focus on topics that cross borders and would
include links to texts and images in different locations—sometimes
in different languages, and sometimes reflecting conflicting
interpretations of the material involved.
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
The Deep Maps Model
(Shelley Fisher Fishkins, 2011)
Deep Maps would be accessible to as broad an international public
as possible. Ideally they would be free and would be available as
pedagogical tools to any teacher or student with access to the
internet. Ideally, they would be hosted on open access university or
other non profit websites. Scholars involved in creating Deep Maps
would work with colleagues and consortiums working in this area
with technical expertise to develop user interfaces that were simple
and clean.
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
User interface
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
Outline
1 Introduction
2 Needs
Which Types of Tasks
Which Types of Resources
3 Dimensions
4 Methods for Construction Corpora
Traditional Human Creation
Crowdsourcing
Our Approach
5 The case of Mark Twain Translations
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
6 Conclusion
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Experiemnt Setup
Building and Visualizing Parallel Corpora
Quality Evaluation
User Feedback
Users would be given the possibility of expressing their
opinions about the collected translations.
Expressed opinions and comments will be automatically
analysed (opinion mining task) in order to propose a first
classification of opinions in polarity according to the following
four classes :
Positive (translation of good quality),
Negative (translation of bad quality),
Mixed (translation has as many positive and negative opinions)
and
Neutral (when the given opinion is none of the above).
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Conclusion and future works
We propose a new paradigm to assess the contribution of
crowdsourcing-based models for collection, and annotation
purposes.
Setting up a generic methodology for tracking the global
circulation of any literary text
Future works :
Include other types of documents related to the novel of Mark
Twain(scientific paper, studies, etc.)
Using collected parallel corpora to extract multilingual
knowledge
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
Introduction
Needs
Dimensions
Methods for Construction Corpora
The case of Mark Twain Translations
Conclusion
Thank you !
Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita

More Related Content

Viewers also liked

Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of ThingsChristopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of MoviesZhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Pablo Benalcazar: Modern Tools on Patent Thicket Identification
Pablo Benalcazar: Modern Tools on Patent Thicket IdentificationPablo Benalcazar: Modern Tools on Patent Thicket Identification
Pablo Benalcazar: Modern Tools on Patent Thicket Identification
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Nauka o informacji w XXI wieku (nowa prezentacja)
Nauka o informacji w XXI wieku (nowa prezentacja) Nauka o informacji w XXI wieku (nowa prezentacja)
Nauka o informacji w XXI wieku (nowa prezentacja) Sabina Cisek
 
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Zachowania informacyjne
Zachowania informacyjneZachowania informacyjne
Zachowania informacyjne
Sabina Cisek
 
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Mariusz Luterek: E-government as a research field
Mariusz Luterek: E-government as a research field Mariusz Luterek: E-government as a research field
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case study
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case studySamia Takhtoukh: The Practices of Historians in the Digital Age: a case study
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case study
Katedra Informatologii. Wydział Dziennikarstwa, Informacji i Bibliologii, Uniwersytet Warszawski
 

Viewers also liked (15)

Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
Karolina Zawada: Toruń University’s Open Access Data Project – the new role f...
 
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
Erika Janiūnienė, Lina Markevičiūtė: The Quality Assessment of Information Se...
 
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
Mieczysław Muraszkiewicz, Warsaw University of Technology: Artificial Intelli...
 
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of ThingsChristopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
Christopher Biedermann, EmiTel Ltd: Cybersecurity and the Internet of Things
 
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
Tibor Koltay, Eszterházy Károly University: Beyond Literacies: The evolving l...
 
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of MoviesZhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
Zhenfei Feng: The Impact of Social Influence on Users’ Ratings of Movies
 
Pablo Benalcazar: Modern Tools on Patent Thicket Identification
Pablo Benalcazar: Modern Tools on Patent Thicket IdentificationPablo Benalcazar: Modern Tools on Patent Thicket Identification
Pablo Benalcazar: Modern Tools on Patent Thicket Identification
 
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
Radosław Lipiński: Information Flow Model as an Effective Tool For Supporting...
 
Nauka o informacji w XXI wieku (nowa prezentacja)
Nauka o informacji w XXI wieku (nowa prezentacja) Nauka o informacji w XXI wieku (nowa prezentacja)
Nauka o informacji w XXI wieku (nowa prezentacja)
 
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
Maciej Dziubecki, Aleph Poland: Applying UX Principles to the Design of Libra...
 
Zachowania informacyjne
Zachowania informacyjneZachowania informacyjne
Zachowania informacyjne
 
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
 
Mariusz Luterek: E-government as a research field
Mariusz Luterek: E-government as a research field Mariusz Luterek: E-government as a research field
Mariusz Luterek: E-government as a research field
 
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
Alicja Waszkiewicz-Raviv: Visual Information and Visual Persuasion in Public ...
 
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case study
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case studySamia Takhtoukh: The Practices of Historians in the Digital Age: a case study
Samia Takhtoukh: The Practices of Historians in the Digital Age: a case study
 

Similar to Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh: Merging Crowdsourcing and Computational Approaches for Digital Humanities

An Abridged Version of My Statement of Research Interests
An Abridged Version of My Statement of Research InterestsAn Abridged Version of My Statement of Research Interests
An Abridged Version of My Statement of Research Interests
adil raja
 
Chi2006 trustworkshop
Chi2006 trustworkshopChi2006 trustworkshop
Chi2006 trustworkshop
John Thomas
 
HIM and New Way of Working
HIM and New Way of WorkingHIM and New Way of Working
HIM and New Way of Working
Pascal Ravesteijn
 
Statement of Research Interests
Statement of Research InterestsStatement of Research Interests
Statement of Research Interests
adil raja
 
Criminal network investigation: Processes, tools, and techniques
Criminal network investigation: Processes, tools, and techniquesCriminal network investigation: Processes, tools, and techniques
Criminal network investigation: Processes, tools, and techniques
Rasmus Petersen
 
E human resources management managing knowledge people ( pdf drive )
E human resources management managing knowledge people ( pdf drive )E human resources management managing knowledge people ( pdf drive )
E human resources management managing knowledge people ( pdf drive )
Priyanka Mehta
 
NLP Introduction.ppt machine learning presentation
NLP  Introduction.ppt machine learning presentationNLP  Introduction.ppt machine learning presentation
NLP Introduction.ppt machine learning presentation
PriyankaRamavath3
 
Position paper for ecscw 2007 workshop
Position paper for ecscw 2007 workshop Position paper for ecscw 2007 workshop
Position paper for ecscw 2007 workshop
John Thomas
 
Dr. Ahmad, origin ontology of future scenario's idea, 3
Dr. Ahmad, origin ontology of future scenario's idea, 3Dr. Ahmad, origin ontology of future scenario's idea, 3
Dr. Ahmad, origin ontology of future scenario's idea, 3
Dr. Ahmad, Futurist.
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the library
Trevor Owens
 
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEYA DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
ijaia
 
IJET-V3I2P23
IJET-V3I2P23IJET-V3I2P23
How do social technologies change knowledge worker business processes km me...
How do social technologies change knowledge worker business processes   km me...How do social technologies change knowledge worker business processes   km me...
How do social technologies change knowledge worker business processes km me...Martin Sumner-Smith
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
Andry Alamsyah
 
Digital Humanities research issues
Digital Humanities research issuesDigital Humanities research issues
Digital Humanities research issuesAmar LAKEL, PhD
 
Leaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di PisaLeaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di Pisa
Daniele Mazzei
 
How Social Software Supports Cooperative Practices in a Globally Distributed ...
How Social Software Supports Cooperative Practices in a Globally Distributed ...How Social Software Supports Cooperative Practices in a Globally Distributed ...
How Social Software Supports Cooperative Practices in a Globally Distributed ...
Rosalba Giuffrida
 
Topics For Analytical Essay.pdf
Topics For Analytical Essay.pdfTopics For Analytical Essay.pdf
Topics For Analytical Essay.pdf
Viviana Principe
 
Toward a socio-technical pattern language
Toward a socio-technical pattern languageToward a socio-technical pattern language
Toward a socio-technical pattern language
John Thomas
 
Ecscw e research-workshop paper jct
Ecscw e research-workshop paper jctEcscw e research-workshop paper jct
Ecscw e research-workshop paper jct
John Thomas
 

Similar to Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh: Merging Crowdsourcing and Computational Approaches for Digital Humanities (20)

An Abridged Version of My Statement of Research Interests
An Abridged Version of My Statement of Research InterestsAn Abridged Version of My Statement of Research Interests
An Abridged Version of My Statement of Research Interests
 
Chi2006 trustworkshop
Chi2006 trustworkshopChi2006 trustworkshop
Chi2006 trustworkshop
 
HIM and New Way of Working
HIM and New Way of WorkingHIM and New Way of Working
HIM and New Way of Working
 
Statement of Research Interests
Statement of Research InterestsStatement of Research Interests
Statement of Research Interests
 
Criminal network investigation: Processes, tools, and techniques
Criminal network investigation: Processes, tools, and techniquesCriminal network investigation: Processes, tools, and techniques
Criminal network investigation: Processes, tools, and techniques
 
E human resources management managing knowledge people ( pdf drive )
E human resources management managing knowledge people ( pdf drive )E human resources management managing knowledge people ( pdf drive )
E human resources management managing knowledge people ( pdf drive )
 
NLP Introduction.ppt machine learning presentation
NLP  Introduction.ppt machine learning presentationNLP  Introduction.ppt machine learning presentation
NLP Introduction.ppt machine learning presentation
 
Position paper for ecscw 2007 workshop
Position paper for ecscw 2007 workshop Position paper for ecscw 2007 workshop
Position paper for ecscw 2007 workshop
 
Dr. Ahmad, origin ontology of future scenario's idea, 3
Dr. Ahmad, origin ontology of future scenario's idea, 3Dr. Ahmad, origin ontology of future scenario's idea, 3
Dr. Ahmad, origin ontology of future scenario's idea, 3
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the library
 
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEYA DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
 
IJET-V3I2P23
IJET-V3I2P23IJET-V3I2P23
IJET-V3I2P23
 
How do social technologies change knowledge worker business processes km me...
How do social technologies change knowledge worker business processes   km me...How do social technologies change knowledge worker business processes   km me...
How do social technologies change knowledge worker business processes km me...
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
 
Digital Humanities research issues
Digital Humanities research issuesDigital Humanities research issues
Digital Humanities research issues
 
Leaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di PisaLeaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di Pisa
 
How Social Software Supports Cooperative Practices in a Globally Distributed ...
How Social Software Supports Cooperative Practices in a Globally Distributed ...How Social Software Supports Cooperative Practices in a Globally Distributed ...
How Social Software Supports Cooperative Practices in a Globally Distributed ...
 
Topics For Analytical Essay.pdf
Topics For Analytical Essay.pdfTopics For Analytical Essay.pdf
Topics For Analytical Essay.pdf
 
Toward a socio-technical pattern language
Toward a socio-technical pattern languageToward a socio-technical pattern language
Toward a socio-technical pattern language
 
Ecscw e research-workshop paper jct
Ecscw e research-workshop paper jctEcscw e research-workshop paper jct
Ecscw e research-workshop paper jct
 

Recently uploaded

Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
JezreelCabil2
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
NelTorrente
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 

Recently uploaded (20)

Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 

Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh: Merging Crowdsourcing and Computational Approaches for Digital Humanities

  • 1. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Merging Crowdsourcing and Computational Approaches for Digital Humanities A Case of Mark Twain Translations Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh The 4th International Scientific Conference Information Science in the Age of Change Warsaw, 15th – 16th May 2017 Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 2. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Outline Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 3. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Objectives Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 4. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Needs Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 5. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Corpora and Ressources Dimensions Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 6. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Types of methods Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 7. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 8. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources Tasks of Natural Language Processing Natural Language Processing processes language material (e.g., text documents) to perform useful tasks Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 9. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources NLP Tasks for Digital Humanities Multilingual Text Analysis Automatic Construction of Multilingual Knowledge Lexicons Terminologies Ontologies etc. Machine Translation Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 10. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 11. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources Parallel Corpora D´efinition A parallel corpus is a corpus that contains a collection of original texts in language L1 and their translations into a set of languages L2...Ln Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 12. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Which Types of Tasks Which Types of Resources Annotated Parallel Corpora Entities Relation markers Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 13. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Linguistic level Corpora : Texts, Sentences, Sentence segments Lexical : Words and terms Language well-endowed languages under resourced languages Domain general-purpose resources Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 14. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 15. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Human Construction of Language Resources The ’traditional’ way Writing a lexicon Writing a thesaurus Writing a grammar Patterns, local grammar Phrase-structure rules Lexico-Syntactico-Semantic patterns or rules Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 16. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 17. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Collective Human Intelligence for Language Resources construction Crowdsourcing is ”the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call.”[Howe, 2006] Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 18. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Crowdsourcing Crowdsourcing is ”the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call.”[Howe, 2006] no a priori selection of the participants (”open call”) Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 19. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Crowdsourcing Crowdsourcing is ”the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call.”[Howe, 2006] no a priori selection of the participants (”open call”) massive (in production and participation) Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 20. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Crowdsourcing Crowdsourcing is ”the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call.”[Howe, 2006] no a priori selection of the participants (”open call”) massive (in production and participation) (relatively) cheap Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 21. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Crowdsourcing model Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 22. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Previous Works : [Fraisse and al., 2014] Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 23. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 24. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Traditional Human Creation Crowdsourcing Our Approach 2-Step Approach for Resources Construction Step 1 : Building initial translations core Crawling Open bases and online source to collect : the source version of a literary text and its translations into a number of well-endowed languages (such as French, German, or Spanish). Step 2 : Data Enrichment Incrementally extend this core to other languages through crowdsourcing data collection tasks, which, should allow us to collect translations into languages that would otherwise be inaccessible. Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 25. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 26. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Mark Twain’s Adventures of Huckleberry Finn Why ? The digitization of the writings of American author Mark Twain (1835-1910) is already very much advanced. Adventures of Huckleberry Finn, deals with transnational and universal topics such as slavery, freedom, childhood, racism, and coming of age ; this focus, combined with the astounding number of translations available, make it an ideal text to use for the prototype in an investigation of the global circulation of a literary text. Large portions of his writings are now in the public domain Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 27. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 28. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 2-Steps for building Parallel Mark Twain’s Corpora 1 Collect the English source text of Adventures of Huckleberry Finn (English) and its translations into a number of well-endowed languages (we started by French) Using open bases offered by National Libraries or any other online source. 2 Use crowdsourcing to collect translations into languages that would otherwise be inaccessible. Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 29. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Data Collection Tasks Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 30. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Data Collection Tasks Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 31. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Translations Tasks Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 32. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation The Deep Maps Model (Shelley Fisher Fishkins, 2011) Deep Maps would embed links to archival texts and images in nodes on an interactive map. To construct them, scholars would mine digital archives around the world for material to include as links, using the durable URL of the text or image in the digital archive in which it resides, as well as additional relevant source information (including the online citation and, if available, the original print source of the text or image as indicated in the online source where it is found). Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 33. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation The Deep Maps Model (Shelley Fisher Fishkins, 2011) Deep Maps would focus on topics that cross borders and would include links to texts and images in different locations—sometimes in different languages, and sometimes reflecting conflicting interpretations of the material involved. Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 34. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation The Deep Maps Model (Shelley Fisher Fishkins, 2011) Deep Maps would be accessible to as broad an international public as possible. Ideally they would be free and would be available as pedagogical tools to any teacher or student with access to the internet. Ideally, they would be hosted on open access university or other non profit websites. Scholars involved in creating Deep Maps would work with colleagues and consortiums working in this area with technical expertise to develop user interfaces that were simple and clean. Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 35. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation User interface Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 36. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation Outline 1 Introduction 2 Needs Which Types of Tasks Which Types of Resources 3 Dimensions 4 Methods for Construction Corpora Traditional Human Creation Crowdsourcing Our Approach 5 The case of Mark Twain Translations Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation 6 Conclusion Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 37. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Experiemnt Setup Building and Visualizing Parallel Corpora Quality Evaluation User Feedback Users would be given the possibility of expressing their opinions about the collected translations. Expressed opinions and comments will be automatically analysed (opinion mining task) in order to propose a first classification of opinions in polarity according to the following four classes : Positive (translation of good quality), Negative (translation of bad quality), Mixed (translation has as many positive and negative opinions) and Neutral (when the given opinion is none of the above). Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 38. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Conclusion and future works We propose a new paradigm to assess the contribution of crowdsourcing-based models for collection, and annotation purposes. Setting up a generic methodology for tracking the global circulation of any literary text Future works : Include other types of documents related to the novel of Mark Twain(scientific paper, studies, etc.) Using collected parallel corpora to extract multilingual knowledge Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita
  • 39. Introduction Needs Dimensions Methods for Construction Corpora The case of Mark Twain Translations Conclusion Thank you ! Amel Fraisse, Ronald Jenn, Quoc-Tan Tran, Samia Takhtoukh Merging Crowdsourcing and Computational Approaches for Digita