SlideShare a Scribd company logo
1 of 16
Download to read offline
News networks in XVII century Italy
Giovanni Colavizza EPFL, Mario Infelise Ca’ Foscari
Subject: the European news ïŹ‚ow
Hypothesis: 1 system of news exchange through Europe.
Raise in demand during 30y War, regular postal service.
Key traits of this information system:
‱ multi-media (handwritten long and short range, more
ïŹ‚exible on demand; printed short range and broader
public)
‱ adaptive “hub and spoke” network
‱ multi-language
Our questions and general aims
How to:
1. prove the existence and extent of the ïŹ‚ow
2. reconstruct its ïŹne-grained dynamic cartography
3. study the problem of information supply and
exchange: media interactions
Basic approach: detect text reuse.
We start by developing robust methods for this end.
How gazettes look
Sources (year 1648)
Asti
Cartagena
Francia
Catalogna
Provenza
Livorno
Alicante
Casale
Parma
Bruxelles
Avignone
Colonia
Palermo
Riviera diPonente
Madrid
Marsiglia
Inghilterra
Lione
Torino
Napoli
Lisbona
Roma
Londra Germania
Milano
Genova
Barcellona
Parigi
Venezia
Bologna
Francia
Svezia
Augusta
Palatinato
Costantinopoli
Monaco
Erfurt
Norimberga
Londra
Franconia
Cassel
Venezia
Vienna
Svevia
Munster
Ratisbona
Amburgo
Francoforte Praga
Colonia
Printed gazettes:
Turin and Genoa
Handwritten: from
Vatican Archives,
Segreteria di
Stato, Avvisi.
Methods: data preparation - printed
Results: editorial policies (printed gazettes)
Most frequent sequence order of printed news in each issue:
‱ Genoa: Genoa, Rome/Naples/Marseille, Milan, Lisbon, Barcelona, Paris, London, Germany and Venice.
‱ Turin: (i1) Turin, Barcelona, Paris, London, Germany; (i2) Milan, Genoa, Naples, Rome and Venice.
Statistic Genoa Turin
Total character
count
281206 579381
Total number of
paragraphs
263 1221
Average
characters per
paragraph
1069 474
Results: editorial policies (printed gazettes)
Sheet1
1 2 3 4 5 6
0
2000
4000
6000
8000
10000
12000
14000
16000
Average text per issue Turin
Genoa
Month
Charcount
Sheet1
1 2 3 4 5 6
0
200
400
600
800
1000
1200
Average text per item Turin
Genoa
Month
Charcount
2000
4000
6000
8000
10000
12000
14000
16000
Average text per issue Turin
Genoa
Charcount
Methods: matching algorithms - printed
Strategy: compare paragraphs (units of formatting/
reading but also meaning)
Global match: SubString Kernels (similarity of sequences
of non-contiguous characters)
Local alignment: Smith-Waterman (ïŹnds local matching
passages)
Threshold ïŹltering and manual evaluation of 2 highest
scoring matches
Results: the ïŹ‚ow (printed gazettes)
Turin
Paris
Barcelona
Lisbon
Milan Venice
London
Naples
Rome
Genoa
Germany
Results: comparisons (printed gazettes)
Categories:
1. verbatim copy of a whole paragraph or parts of it
2. paraphrasing or translations of the same source
3. same news from different sources
4. same topic but different news
Results:
1 and 3 <1%
2 circa 30%
4 circa 43%
Evaluation:
precision by hand
recall “intractable”
Methods: data preparation - handwritten
Plenipotentiario di Spagna (keyword)
Re di Spagna (name_of_person)
Conte d'AvĂČ (name_of_person)
spagnoli (quantity)
Ambasciatore di Portogallo (keyword)
Perera (name_of_person)
Hassi (keyword)
Cassel (name_of_place)
Plenipotentiario di Franza (keyword)
Sua MaestĂ  Cesarea (name_of_person)
Landgraviessa d'Assia (name_of_person)
Osnapruch (name_of_place)
trattato dell'Imperio (keyword)
Lantgravio di Darmstat (name_of_person)
Amnistia nello stati hereditarij (keyword)
anni (quantity)
Pinorada (name_of_person)
Svedesi (keyword)
Provincia d'Utrecht (name_of_place)
pace (keyword)
Spagna (name_of_place)
Olanda (name_of_place)
Zelanda (name_of_place)
Provinzie Basse (name_of_place)
Francia (name_of_place)
Methods: matching algorithms - handwritten
Strategy: compare paragraphs
Typed canonicalisation: similar words are grouped into
typed categories (Jaro-Winkler distance)
Paragraph comparison: Tf-idf vectors, cosine distance
Manual evaluation of 2 highest scoring matches
Too limited and skewed corpus for now..
Results: matchings (handwritten)
Munster 24 April 1648:
Cologne 19
April 1648:
High score, same
topic, different news.
Different news-sheets
Open questions
1. How to effectively evaluate results? The open question
of scalable recall and precision
2. How to get a larger corpus (e.g. at least 2 years to
study seasonality)? 1) lack of data 2) cost of data
preparation
3. How to compare printed and handwritten news?
Ongoing work
4. What to focus on? Variations are as interesting as
verbatim copies to study the interaction of different
medias and types of gazettes..
News networks in XVII century Italy
Thanks
Giovanni Colavizza EPFL, Mario Infelise Ca’ Foscari

More Related Content

Similar to Mapping the News Networks in XVII Italy

· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
gerardkortney
 
Assignment 1 Reviewing Research and Making Connect.docx
Assignment 1 Reviewing Research and Making Connect.docxAssignment 1 Reviewing Research and Making Connect.docx
Assignment 1 Reviewing Research and Making Connect.docx
deanmtaylor1545
 

Similar to Mapping the News Networks in XVII Italy (13)

Allusion And Ambiguity In Seamus Heaney S Quot Blackberry-Picking
Allusion And Ambiguity In Seamus Heaney S  Quot Blackberry-PickingAllusion And Ambiguity In Seamus Heaney S  Quot Blackberry-Picking
Allusion And Ambiguity In Seamus Heaney S Quot Blackberry-Picking
 
· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
· Coronel & Morris Chapter 7, Problems 1, 2 and 3.docx
 
European or Imperial Metropolis? Depictions of London in British Newspapers, ...
European or Imperial Metropolis? Depictions of London in British Newspapers, ...European or Imperial Metropolis? Depictions of London in British Newspapers, ...
European or Imperial Metropolis? Depictions of London in British Newspapers, ...
 
Bibliotheca Digitalis Summer school: Prosopographical data and Cultural netwo...
Bibliotheca Digitalis Summer school: Prosopographical data and Cultural netwo...Bibliotheca Digitalis Summer school: Prosopographical data and Cultural netwo...
Bibliotheca Digitalis Summer school: Prosopographical data and Cultural netwo...
 
Assignment 1 Reviewing Research and Making Connect.docx
Assignment 1 Reviewing Research and Making Connect.docxAssignment 1 Reviewing Research and Making Connect.docx
Assignment 1 Reviewing Research and Making Connect.docx
 
Crisis of scientific communication; fact or fiction? (Rudolf Haƈka)
Crisis of scientific communication; fact or fiction? (Rudolf Haƈka)Crisis of scientific communication; fact or fiction? (Rudolf Haƈka)
Crisis of scientific communication; fact or fiction? (Rudolf Haƈka)
 
17th Century Mathematics
17th Century Mathematics17th Century Mathematics
17th Century Mathematics
 
Tagung Staatspersonifikationen Programm (MĂ€rz 2016)
Tagung Staatspersonifikationen Programm (MĂ€rz 2016)Tagung Staatspersonifikationen Programm (MĂ€rz 2016)
Tagung Staatspersonifikationen Programm (MĂ€rz 2016)
 
MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdf
 
Commemorating the Great War on Twitter
Commemorating the Great War on TwitterCommemorating the Great War on Twitter
Commemorating the Great War on Twitter
 
(Oxford World's Classics) René Descartes, Ian Maclean - Discourse Method of C...
(Oxford World's Classics) René Descartes, Ian Maclean - Discourse Method of C...(Oxford World's Classics) René Descartes, Ian Maclean - Discourse Method of C...
(Oxford World's Classics) René Descartes, Ian Maclean - Discourse Method of C...
 
Data versus Text: 30 years of confrontation
Data versus Text: 30 years of confrontationData versus Text: 30 years of confrontation
Data versus Text: 30 years of confrontation
 
2004. Modernism And Its Metaphors Rereading The Voyage Out By Virginia Wo...
2004.  Modernism And Its Metaphors  Rereading  The Voyage Out  By Virginia Wo...2004.  Modernism And Its Metaphors  Rereading  The Voyage Out  By Virginia Wo...
2004. Modernism And Its Metaphors Rereading The Voyage Out By Virginia Wo...
 

More from Giovanni Colavizza

Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013
Giovanni Colavizza
 

More from Giovanni Colavizza (8)

Sul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital HumanitiesSul ruolo dell’umanista nelle Digital Humanities
Sul ruolo dell’umanista nelle Digital Humanities
 
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
La Venice Time Machine e alcune sfide dei progetti “Big Science” nelle discip...
 
The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...The References of References: Enriching Library Catalogs via Domain-Specific ...
The References of References: Enriching Library Catalogs via Domain-Specific ...
 
Notes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliensNotes de bas de page: d’un outil savant aux hyperliens
Notes de bas de page: d’un outil savant aux hyperliens
 
Introduction to the Venice Time Machine
Introduction to the Venice Time MachineIntroduction to the Venice Time Machine
Introduction to the Venice Time Machine
 
Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014Linked Books - DH Venice Fall School 2014
Linked Books - DH Venice Fall School 2014
 
Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013Leipzig Functional Categorisation 11/12/2013
Leipzig Functional Categorisation 11/12/2013
 
Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013Venezia Biblioteche e Digital Humanities 28/10/2013
Venezia Biblioteche e Digital Humanities 28/10/2013
 

Recently uploaded

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
SĂ©rgio Sacani
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
Cherry
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
SĂ©rgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
levieagacer
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Cherry
 

Recently uploaded (20)

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 

Mapping the News Networks in XVII Italy

  • 1. News networks in XVII century Italy Giovanni Colavizza EPFL, Mario Infelise Ca’ Foscari
  • 2. Subject: the European news ïŹ‚ow Hypothesis: 1 system of news exchange through Europe. Raise in demand during 30y War, regular postal service. Key traits of this information system: ‱ multi-media (handwritten long and short range, more ïŹ‚exible on demand; printed short range and broader public) ‱ adaptive “hub and spoke” network ‱ multi-language
  • 3. Our questions and general aims How to: 1. prove the existence and extent of the ïŹ‚ow 2. reconstruct its ïŹne-grained dynamic cartography 3. study the problem of information supply and exchange: media interactions Basic approach: detect text reuse. We start by developing robust methods for this end.
  • 5. Sources (year 1648) Asti Cartagena Francia Catalogna Provenza Livorno Alicante Casale Parma Bruxelles Avignone Colonia Palermo Riviera diPonente Madrid Marsiglia Inghilterra Lione Torino Napoli Lisbona Roma Londra Germania Milano Genova Barcellona Parigi Venezia Bologna Francia Svezia Augusta Palatinato Costantinopoli Monaco Erfurt Norimberga Londra Franconia Cassel Venezia Vienna Svevia Munster Ratisbona Amburgo Francoforte Praga Colonia Printed gazettes: Turin and Genoa Handwritten: from Vatican Archives, Segreteria di Stato, Avvisi.
  • 7. Results: editorial policies (printed gazettes) Most frequent sequence order of printed news in each issue: ‱ Genoa: Genoa, Rome/Naples/Marseille, Milan, Lisbon, Barcelona, Paris, London, Germany and Venice. ‱ Turin: (i1) Turin, Barcelona, Paris, London, Germany; (i2) Milan, Genoa, Naples, Rome and Venice. Statistic Genoa Turin Total character count 281206 579381 Total number of paragraphs 263 1221 Average characters per paragraph 1069 474
  • 8. Results: editorial policies (printed gazettes) Sheet1 1 2 3 4 5 6 0 2000 4000 6000 8000 10000 12000 14000 16000 Average text per issue Turin Genoa Month Charcount Sheet1 1 2 3 4 5 6 0 200 400 600 800 1000 1200 Average text per item Turin Genoa Month Charcount 2000 4000 6000 8000 10000 12000 14000 16000 Average text per issue Turin Genoa Charcount
  • 9. Methods: matching algorithms - printed Strategy: compare paragraphs (units of formatting/ reading but also meaning) Global match: SubString Kernels (similarity of sequences of non-contiguous characters) Local alignment: Smith-Waterman (ïŹnds local matching passages) Threshold ïŹltering and manual evaluation of 2 highest scoring matches
  • 10. Results: the ïŹ‚ow (printed gazettes) Turin Paris Barcelona Lisbon Milan Venice London Naples Rome Genoa Germany
  • 11. Results: comparisons (printed gazettes) Categories: 1. verbatim copy of a whole paragraph or parts of it 2. paraphrasing or translations of the same source 3. same news from different sources 4. same topic but different news Results: 1 and 3 <1% 2 circa 30% 4 circa 43% Evaluation: precision by hand recall “intractable”
  • 12. Methods: data preparation - handwritten Plenipotentiario di Spagna (keyword) Re di Spagna (name_of_person) Conte d'AvĂČ (name_of_person) spagnoli (quantity) Ambasciatore di Portogallo (keyword) Perera (name_of_person) Hassi (keyword) Cassel (name_of_place) Plenipotentiario di Franza (keyword) Sua MaestĂ  Cesarea (name_of_person) Landgraviessa d'Assia (name_of_person) Osnapruch (name_of_place) trattato dell'Imperio (keyword) Lantgravio di Darmstat (name_of_person) Amnistia nello stati hereditarij (keyword) anni (quantity) Pinorada (name_of_person) Svedesi (keyword) Provincia d'Utrecht (name_of_place) pace (keyword) Spagna (name_of_place) Olanda (name_of_place) Zelanda (name_of_place) Provinzie Basse (name_of_place) Francia (name_of_place)
  • 13. Methods: matching algorithms - handwritten Strategy: compare paragraphs Typed canonicalisation: similar words are grouped into typed categories (Jaro-Winkler distance) Paragraph comparison: Tf-idf vectors, cosine distance Manual evaluation of 2 highest scoring matches Too limited and skewed corpus for now..
  • 14. Results: matchings (handwritten) Munster 24 April 1648: Cologne 19 April 1648: High score, same topic, different news. Different news-sheets
  • 15. Open questions 1. How to effectively evaluate results? The open question of scalable recall and precision 2. How to get a larger corpus (e.g. at least 2 years to study seasonality)? 1) lack of data 2) cost of data preparation 3. How to compare printed and handwritten news? Ongoing work 4. What to focus on? Variations are as interesting as verbatim copies to study the interaction of different medias and types of gazettes..
  • 16. News networks in XVII century Italy Thanks Giovanni Colavizza EPFL, Mario Infelise Ca’ Foscari