Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
From unstructured data to structured journalism
1. From unstructured data to
structured journalism
Giuseppe Futia
Nexa Center for Internet and Society, Politecnico di Torino
(DAUIN)
April 12, 2016
Master in Giornalismo "Giorgio Bocca" di Torino
2. Nexa Center for Internet &
Society at Politecnico di Torino
Website:
http://nexa.polito.it/
11. Using "machine learning," technologists
at news outlets around the world are
helping newsrooms eliminate extra
time-consuming tasks and giving
humans more time to do what they do
best: reporting the news (Poynter.org)
17. Panama papers leak
• 11.5 million of documents
– 4.8 million of mails
– 4 million of database entries
– 2 million of PDFs
– 1 million of images
– 320.000 text documents
• 100 news organisations and 400 journalists
18. Panama papers processing
• Sort and organise the files
• Index these files
• Bring out all of the metadata
• Investigate data from the big data and
analytical perspective
19. Panama papers result
• The final database: 30 per cent of the original
data size
• Bring out entities: first names and second
names
• Analytics to find how these names refer to the
documents