SlideShare a Scribd company logo
RSA 2019, Toronto
Preconference day
March 16, 2019
11AM-1PM
Programm
 11-11:05 -- Introduction to the session and
presenters
 PRESENTATION OF PROJECTS
 11:05-11:20 – Jodi: Mapping Titan, Mapping
Paintings
 11:20-11:35 – Catherine: Mapping Sculpture
 PRESENTATION OF TOOLS
 11:35-12:05 – Angela: OpenRefine, TimelineJS
 12:05-12:35 – Catherine: Palladio, CARTO
 Hands-on
OpenRefine
 Cleaning up messy data from a
spreadsheet
 Spelling errors
 Uniform data
 Removing whitespace
 Splitting columns
 Enriching data from external sources
 Etc.
You won’t be analysing your data one by one, but
in groups and sets. Therefore the application is
suitable for very large data sets.
OpenRefine
 Apart from cleaning data, you can also
use Open Refine for different purposes
 Word counts in sets
 Combine sheets
 Enriching reconciled data with Open Refine:
Import data from Wikidata or VIAF
OpenRefine
 Free, open source software
 Works best with Google Chrome (less with Safari and
Explorer)
 Written in Java. Requires Java JRE
 Works with Interactive Data Transformation tools (IDTs),
which allows to change a big data set at one time. It is
similar to a spreadsheet, but has more functionalities.
 Works as a destop application. It does not store your
data. Save them! It may be used in several tabs
contemporaneously.
 The .exe file opens a terminal window as web application,
where the little server is running. It needs to remain open.
Runs offline through the terminal window.
OpenRefine
 Chose a project and upload it.
 Rename project (save it later, Open Refine does not save
or store automatically!!)
 Use code UTF-8
 Configure your data: You will be shown a preview of your
data. In the lower blue field, make sure “Parse data as” is
set to “CSV / TSV / separator-based files”. Where it says
character encoding, click in the blank field next to it and
select UTF-8 from the pop-up window of encodings. Make
sure the first row with your column headers is recognized
as headers (boldfaced) and not as your data. If it is not
automatically recognized, check the click box for “Parse
next ‘1’ line(s) as column headers”. Since our exercise file
is a CSV, activate the radio button “commas (CSV)” as the
separator.
OpenRefine – basic clean
up
 Text facet -> cluster
 Get rid of whitespace: «Edit cells» -> «Common
transforms» -> «Trim leading and trailing whitespace» /
«Collapse consecutive whitespace»
 Divide columns: «Edit column» -> «Split into several
columns…»
 Reorder columns
 Cluster: «Edit cells» -> «Cluster and edit…» (only works
for entire clusters to be merged, no selection possible)
 Replace: Edit cells -> replace
 Undo/redo: step by step index in the menu
 Cancelling: Text facet –> chose what to eliminate and
place a star –> back to facet by star –> true –> under all –
facet by star –> remove all matching rows
OpenRefine - transform
 Exchange values: Edit cells -> transform ->
GREL language -> transform the value
 Replace: value.replace(‘xx’, ‘x’)
 Add characters to a column: “prefix” + value
 Cleaning up a date to show only the year:
datePart(value,'year')
 GREL : General Refine Expression Language on
GitHub
https://github.com/OpenRefine/OpenRefine/wiki/Gen
eral-Refine-Expression-Language
OpenRefine – example from
Wikipedia – Italian artists
 Download table from Wikipedia
 You want to separate names and years
 Add column based on this column
 Edit cells -> replace (to change the brakets into a colon, to be
used later as idenfier)
 Edit column – split into several columns (use colon as identifier)
 Replace ) by null
 Value + «, « + cells(«mycell»).value
 Person separate: edit column – add column based on this
column – value.split(« «)[1]
○ 1= last name / 0= first name
 Add last name, first name together: value + «, « +
cells[«Firstname»].value
 Another option: Split cells: Choose ‘Edit cells’, ‘Split multi-
valued cells’, entering ‘|’ as the value separator.
OpenRefine for Data
enrichment
(using Linked Open Data)
 Fetch URLs using Refine
 Contruct URL queries to retrieve
information from a simple web API
 Using query services like:
 Wikidata
 Google maps API
 VIAF (Virtual International Authority File)
 etc.
Retrieving data from
Wikidata
 You need a column Wikidata_uri
 Create a column Wikidata_id: Edit column –> add
column based on this column –> for the ID extraction
enter value
replace(value,"http://www.wikidata.org/entity/", "")
 On Wikidata_id column: Edit column -> add column
by fetching URLs -> if you want to query birth dates
enter value «P569»
("https://tools.wmflabs.org/openrefine-
wikidata/en/fetch_values?item="+value+"&prop=P56
9") -> name column «date_of_birth_Wikidata». The
result is in JSON.
 Clean data by -> edit cells -> transform -> for value
enter forEach(value.parseJson().values,v,v).join(";")
 Cleaning up a date to show only the year:
datePart(value,'year')
Retrieving data from
Wikidata
 Reconcile (how simple is this!!)
 Chose source – Wikidata (in case include
other columns too)
 Start reconciling – record will be
automatically linked to Wikidata (some rest
has to be done manually)
 Use values as identifiers
OpenRefine - export
 At the end: export your data set! (Open
Refine does not change your original
data set)
 Single column export -> facet -> chose
facet -> export csv
 Full sheet export -> comma-separated
value
 It is also possible to only export parts of
your sheet.
OpenRefine tutorials
 http://openrefine.org/
 https://programminghistorian.org/en/lessons/cleaning
-data-with-openrefine
 https://github.com/miriamposner/get-started-with-
openrefine/blob/master/get-started-with-
openrefine.md
 https://github.com/OpenRefine/OpenRefine/wiki/Doc
umentation-For-Users
 Retrieving data from Wikidata or VIAF
https://medium.com/the-bytegeist-blog/enriching-
reconciled-data-with-openrefine-89b885dcadbb
 There are many more!!
Timelines (selection)
 Timeline JS (Northwestern University)
https://news.northwestern.edu/stories/2012
/03/knight-lab-digital-timelines/ (with
examples and spreadsheet)
 Neatline – for Omeka
http://docs.neatline.org/creating-records.html
 Google Timeline
https://www.google.com/maps/timeline?pb
 Office Timelines (for Excel or Powerpoint)
https://templates.office.com/en-
us/Timelines?page=1
TimelineJS
With Google Chrome and Google Spreadsheets
 Advantages
 Easy to use for a chronological visualization
 Incorporates maps and images from the web
 Can be incorporated into Websites and
Powerpoints
 Disadvantages
 Limited interactivity
 Only uses images published on the web, not
from own collection
TimelineJS
With Google Chrome
 https://timeline.knightlab.com/

 Botticelli spreadsheet:
https://docs.google.com/spreadsheets/d/
1BAg-2_XZM-
Oap1cwQoftBcYjrJYBjXOSNOqdXBwQ
WyY/edit#gid=0
 Botticelli timeline (imbedded link to
website or presentation)
Thank you !
Dr. Angela Dressen
Villa I Tatti, The Harvard University Center
for Italian Renaissance Studies / Florenz,
Italy
adressen@itatti.harvard.edu
Discipline Representative for Digital
Humanities at the Renaissance Society of
America (RSA)

More Related Content

Similar to Dressen-RSA-2019-preconference-data-workshop-copy.pptx

Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
debataraja
 
Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
kunkumabala
 
OpenRefine
OpenRefineOpenRefine
MS Office Access Tutorial
MS Office Access TutorialMS Office Access Tutorial
MS Office Access Tutorial
virtualMaryam
 
Automation Of Reporting And Alerting
Automation Of Reporting And AlertingAutomation Of Reporting And Alerting
Automation Of Reporting And Alerting
Sean Durocher
 
ADVANCE ITT BY PRASAD
ADVANCE ITT BY PRASADADVANCE ITT BY PRASAD
ADVANCE ITT BY PRASAD
PADYALAMAITHILINATHA
 
Itm310 problem solving #7 complete solutions correct answers key
Itm310 problem solving #7 complete solutions correct answers keyItm310 problem solving #7 complete solutions correct answers key
Itm310 problem solving #7 complete solutions correct answers key
Song Love
 
Chapter.07
Chapter.07Chapter.07
L1 - Recap.pdf
L1 - Recap.pdfL1 - Recap.pdf
L1 - Recap.pdf
Ifat Nix
 
Std 10 Computer Chapter 5 Introduction to Calc
Std 10 Computer Chapter 5 Introduction to CalcStd 10 Computer Chapter 5 Introduction to Calc
Std 10 Computer Chapter 5 Introduction to Calc
Nuzhat Memon
 
Introduction to matlab
Introduction to matlabIntroduction to matlab
Introduction to matlab
vikrammutneja1
 
Article link httpiveybusinessjournal.compublicationmanaging-.docx
Article link httpiveybusinessjournal.compublicationmanaging-.docxArticle link httpiveybusinessjournal.compublicationmanaging-.docx
Article link httpiveybusinessjournal.compublicationmanaging-.docx
fredharris32
 
Lession 6.introduction to records
Lession 6.introduction to recordsLession 6.introduction to records
Lession 6.introduction to records
Đỗ Đức Hùng
 
1 Introduction to SPSS.pdf
1 Introduction to SPSS.pdf1 Introduction to SPSS.pdf
1 Introduction to SPSS.pdf
Yomif3
 
Text processing by Rj
Text processing by RjText processing by Rj
MapInfo Professional 12.0 and SQL Server 2008
MapInfo Professional 12.0 and SQL Server 2008MapInfo Professional 12.0 and SQL Server 2008
MapInfo Professional 12.0 and SQL Server 2008
Peter Horsbøll Møller
 
Stata Cheat Sheets (all)
Stata Cheat Sheets (all)Stata Cheat Sheets (all)
Stata Cheat Sheets (all)
Laura Hughes
 
Libre Office Calc Lesson 1: Introduction to spreadsheets
Libre Office Calc Lesson 1: Introduction to spreadsheetsLibre Office Calc Lesson 1: Introduction to spreadsheets
Libre Office Calc Lesson 1: Introduction to spreadsheets
Smart Chicago Collaborative
 
Pandas csv
Pandas csvPandas csv
Pandas csv
Devashish Kumar
 
Top 20 something info path 2010 tips and trips - sps-ozarks12
Top 20 something info path 2010 tips and trips - sps-ozarks12Top 20 something info path 2010 tips and trips - sps-ozarks12
Top 20 something info path 2010 tips and trips - sps-ozarks12
Kevin Dostalek
 

Similar to Dressen-RSA-2019-preconference-data-workshop-copy.pptx (20)

Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
 
Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
 
OpenRefine
OpenRefineOpenRefine
OpenRefine
 
MS Office Access Tutorial
MS Office Access TutorialMS Office Access Tutorial
MS Office Access Tutorial
 
Automation Of Reporting And Alerting
Automation Of Reporting And AlertingAutomation Of Reporting And Alerting
Automation Of Reporting And Alerting
 
ADVANCE ITT BY PRASAD
ADVANCE ITT BY PRASADADVANCE ITT BY PRASAD
ADVANCE ITT BY PRASAD
 
Itm310 problem solving #7 complete solutions correct answers key
Itm310 problem solving #7 complete solutions correct answers keyItm310 problem solving #7 complete solutions correct answers key
Itm310 problem solving #7 complete solutions correct answers key
 
Chapter.07
Chapter.07Chapter.07
Chapter.07
 
L1 - Recap.pdf
L1 - Recap.pdfL1 - Recap.pdf
L1 - Recap.pdf
 
Std 10 Computer Chapter 5 Introduction to Calc
Std 10 Computer Chapter 5 Introduction to CalcStd 10 Computer Chapter 5 Introduction to Calc
Std 10 Computer Chapter 5 Introduction to Calc
 
Introduction to matlab
Introduction to matlabIntroduction to matlab
Introduction to matlab
 
Article link httpiveybusinessjournal.compublicationmanaging-.docx
Article link httpiveybusinessjournal.compublicationmanaging-.docxArticle link httpiveybusinessjournal.compublicationmanaging-.docx
Article link httpiveybusinessjournal.compublicationmanaging-.docx
 
Lession 6.introduction to records
Lession 6.introduction to recordsLession 6.introduction to records
Lession 6.introduction to records
 
1 Introduction to SPSS.pdf
1 Introduction to SPSS.pdf1 Introduction to SPSS.pdf
1 Introduction to SPSS.pdf
 
Text processing by Rj
Text processing by RjText processing by Rj
Text processing by Rj
 
MapInfo Professional 12.0 and SQL Server 2008
MapInfo Professional 12.0 and SQL Server 2008MapInfo Professional 12.0 and SQL Server 2008
MapInfo Professional 12.0 and SQL Server 2008
 
Stata Cheat Sheets (all)
Stata Cheat Sheets (all)Stata Cheat Sheets (all)
Stata Cheat Sheets (all)
 
Libre Office Calc Lesson 1: Introduction to spreadsheets
Libre Office Calc Lesson 1: Introduction to spreadsheetsLibre Office Calc Lesson 1: Introduction to spreadsheets
Libre Office Calc Lesson 1: Introduction to spreadsheets
 
Pandas csv
Pandas csvPandas csv
Pandas csv
 
Top 20 something info path 2010 tips and trips - sps-ozarks12
Top 20 something info path 2010 tips and trips - sps-ozarks12Top 20 something info path 2010 tips and trips - sps-ozarks12
Top 20 something info path 2010 tips and trips - sps-ozarks12
 

More from AvneeshKumar164042

Dental Radiography machine.ppt
Dental Radiography machine.pptDental Radiography machine.ppt
Dental Radiography machine.ppt
AvneeshKumar164042
 
telagana.ppt
telagana.ppttelagana.ppt
telagana.ppt
AvneeshKumar164042
 
BholuMNNIT.ppt
BholuMNNIT.pptBholuMNNIT.ppt
BholuMNNIT.ppt
AvneeshKumar164042
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.ppt
AvneeshKumar164042
 
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
AvneeshKumar164042
 
Hypertension.ppt
Hypertension.pptHypertension.ppt
Hypertension.ppt
AvneeshKumar164042
 
GenerationofXRays.ppt
GenerationofXRays.pptGenerationofXRays.ppt
GenerationofXRays.ppt
AvneeshKumar164042
 
2-180318193019.pdf
2-180318193019.pdf2-180318193019.pdf
2-180318193019.pdf
AvneeshKumar164042
 
xrayproductionandproperties-171229054704.pdf
xrayproductionandproperties-171229054704.pdfxrayproductionandproperties-171229054704.pdf
xrayproductionandproperties-171229054704.pdf
AvneeshKumar164042
 
TR-069_Overview.ppt
TR-069_Overview.pptTR-069_Overview.ppt
TR-069_Overview.ppt
AvneeshKumar164042
 
barbados-day1-presentation_blending.pptx
barbados-day1-presentation_blending.pptxbarbados-day1-presentation_blending.pptx
barbados-day1-presentation_blending.pptx
AvneeshKumar164042
 
Capacity Development Program for Investment Promotion Agencies of the LDCs by...
Capacity Development Program for Investment Promotion Agencies of the LDCs by...Capacity Development Program for Investment Promotion Agencies of the LDCs by...
Capacity Development Program for Investment Promotion Agencies of the LDCs by...
AvneeshKumar164042
 
BCH_6.4_international Business_week 4_vartika_FDI.pptx
BCH_6.4_international Business_week 4_vartika_FDI.pptxBCH_6.4_international Business_week 4_vartika_FDI.pptx
BCH_6.4_international Business_week 4_vartika_FDI.pptx
AvneeshKumar164042
 
2) Infective endocarditis .pptx
2) Infective endocarditis .pptx2) Infective endocarditis .pptx
2) Infective endocarditis .pptx
AvneeshKumar164042
 
Sinusitis.ppt
Sinusitis.pptSinusitis.ppt
Sinusitis.ppt
AvneeshKumar164042
 
Arterial Blood Gas.ppt1.ppt
Arterial Blood Gas.ppt1.pptArterial Blood Gas.ppt1.ppt
Arterial Blood Gas.ppt1.ppt
AvneeshKumar164042
 
Chapter15.ppt
Chapter15.pptChapter15.ppt
Chapter15.ppt
AvneeshKumar164042
 
Chapter_021.pptx
Chapter_021.pptxChapter_021.pptx
Chapter_021.pptx
AvneeshKumar164042
 
24_lecture_pptEK.ppt
24_lecture_pptEK.ppt24_lecture_pptEK.ppt
24_lecture_pptEK.ppt
AvneeshKumar164042
 
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptx
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptxSoft-Skills-Usman- Ghani-Akbani-for-participants.pptx
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptx
AvneeshKumar164042
 

More from AvneeshKumar164042 (20)

Dental Radiography machine.ppt
Dental Radiography machine.pptDental Radiography machine.ppt
Dental Radiography machine.ppt
 
telagana.ppt
telagana.ppttelagana.ppt
telagana.ppt
 
BholuMNNIT.ppt
BholuMNNIT.pptBholuMNNIT.ppt
BholuMNNIT.ppt
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.ppt
 
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
13-Ling-21---Lecture-12b---Language-Thought-and-Culture.ppt
 
Hypertension.ppt
Hypertension.pptHypertension.ppt
Hypertension.ppt
 
GenerationofXRays.ppt
GenerationofXRays.pptGenerationofXRays.ppt
GenerationofXRays.ppt
 
2-180318193019.pdf
2-180318193019.pdf2-180318193019.pdf
2-180318193019.pdf
 
xrayproductionandproperties-171229054704.pdf
xrayproductionandproperties-171229054704.pdfxrayproductionandproperties-171229054704.pdf
xrayproductionandproperties-171229054704.pdf
 
TR-069_Overview.ppt
TR-069_Overview.pptTR-069_Overview.ppt
TR-069_Overview.ppt
 
barbados-day1-presentation_blending.pptx
barbados-day1-presentation_blending.pptxbarbados-day1-presentation_blending.pptx
barbados-day1-presentation_blending.pptx
 
Capacity Development Program for Investment Promotion Agencies of the LDCs by...
Capacity Development Program for Investment Promotion Agencies of the LDCs by...Capacity Development Program for Investment Promotion Agencies of the LDCs by...
Capacity Development Program for Investment Promotion Agencies of the LDCs by...
 
BCH_6.4_international Business_week 4_vartika_FDI.pptx
BCH_6.4_international Business_week 4_vartika_FDI.pptxBCH_6.4_international Business_week 4_vartika_FDI.pptx
BCH_6.4_international Business_week 4_vartika_FDI.pptx
 
2) Infective endocarditis .pptx
2) Infective endocarditis .pptx2) Infective endocarditis .pptx
2) Infective endocarditis .pptx
 
Sinusitis.ppt
Sinusitis.pptSinusitis.ppt
Sinusitis.ppt
 
Arterial Blood Gas.ppt1.ppt
Arterial Blood Gas.ppt1.pptArterial Blood Gas.ppt1.ppt
Arterial Blood Gas.ppt1.ppt
 
Chapter15.ppt
Chapter15.pptChapter15.ppt
Chapter15.ppt
 
Chapter_021.pptx
Chapter_021.pptxChapter_021.pptx
Chapter_021.pptx
 
24_lecture_pptEK.ppt
24_lecture_pptEK.ppt24_lecture_pptEK.ppt
24_lecture_pptEK.ppt
 
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptx
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptxSoft-Skills-Usman- Ghani-Akbani-for-participants.pptx
Soft-Skills-Usman- Ghani-Akbani-for-participants.pptx
 

Recently uploaded

Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
shivalingatalekar1
 
Does Over-Masturbation Contribute to Chronic Prostatitis.pptx
Does Over-Masturbation Contribute to Chronic Prostatitis.pptxDoes Over-Masturbation Contribute to Chronic Prostatitis.pptx
Does Over-Masturbation Contribute to Chronic Prostatitis.pptx
walterHu5
 
THERAPEUTIC ANTISENSE MOLECULES .pptx
THERAPEUTIC ANTISENSE MOLECULES    .pptxTHERAPEUTIC ANTISENSE MOLECULES    .pptx
THERAPEUTIC ANTISENSE MOLECULES .pptx
70KRISHPATEL
 
Top 10 Best Ayurvedic Kidney Stone Syrups in India
Top 10 Best Ayurvedic Kidney Stone Syrups in IndiaTop 10 Best Ayurvedic Kidney Stone Syrups in India
Top 10 Best Ayurvedic Kidney Stone Syrups in India
Swastik Ayurveda
 
NVBDCP.pptx Nation vector borne disease control program
NVBDCP.pptx Nation vector borne disease control programNVBDCP.pptx Nation vector borne disease control program
NVBDCP.pptx Nation vector borne disease control program
Sapna Thakur
 
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
bkling
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
Dr. Jyothirmai Paindla
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
Holistified Wellness
 
Top-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India ListTop-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India List
SwisschemDerma
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
KafrELShiekh University
 
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
rishi2789
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
Donc Test
 
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradeshBasavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Dr. Madduru Muni Haritha
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
Jim Jacob Roy
 
Identification and nursing management of congenital malformations .pptx
Identification and nursing management of congenital malformations .pptxIdentification and nursing management of congenital malformations .pptx
Identification and nursing management of congenital malformations .pptx
MGM SCHOOL/COLLEGE OF NURSING
 
Top Effective Soaps for Fungal Skin Infections in India
Top Effective Soaps for Fungal Skin Infections in IndiaTop Effective Soaps for Fungal Skin Infections in India
Top Effective Soaps for Fungal Skin Infections in India
SwisschemDerma
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
rishi2789
 
Light House Retreats: Plant Medicine Retreat Europe
Light House Retreats: Plant Medicine Retreat EuropeLight House Retreats: Plant Medicine Retreat Europe
Light House Retreats: Plant Medicine Retreat Europe
Lighthouse Retreat
 
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotesPromoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
PsychoTech Services
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
MedicoseAcademics
 

Recently uploaded (20)

Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
 
Does Over-Masturbation Contribute to Chronic Prostatitis.pptx
Does Over-Masturbation Contribute to Chronic Prostatitis.pptxDoes Over-Masturbation Contribute to Chronic Prostatitis.pptx
Does Over-Masturbation Contribute to Chronic Prostatitis.pptx
 
THERAPEUTIC ANTISENSE MOLECULES .pptx
THERAPEUTIC ANTISENSE MOLECULES    .pptxTHERAPEUTIC ANTISENSE MOLECULES    .pptx
THERAPEUTIC ANTISENSE MOLECULES .pptx
 
Top 10 Best Ayurvedic Kidney Stone Syrups in India
Top 10 Best Ayurvedic Kidney Stone Syrups in IndiaTop 10 Best Ayurvedic Kidney Stone Syrups in India
Top 10 Best Ayurvedic Kidney Stone Syrups in India
 
NVBDCP.pptx Nation vector borne disease control program
NVBDCP.pptx Nation vector borne disease control programNVBDCP.pptx Nation vector borne disease control program
NVBDCP.pptx Nation vector borne disease control program
 
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
Part II - Body Grief: Losing parts of ourselves and our identity before, duri...
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
 
Top-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India ListTop-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India List
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
 
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
 
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradeshBasavarajeeyam - Ayurvedic heritage book of Andhra pradesh
Basavarajeeyam - Ayurvedic heritage book of Andhra pradesh
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
 
Identification and nursing management of congenital malformations .pptx
Identification and nursing management of congenital malformations .pptxIdentification and nursing management of congenital malformations .pptx
Identification and nursing management of congenital malformations .pptx
 
Top Effective Soaps for Fungal Skin Infections in India
Top Effective Soaps for Fungal Skin Infections in IndiaTop Effective Soaps for Fungal Skin Infections in India
Top Effective Soaps for Fungal Skin Infections in India
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
 
Light House Retreats: Plant Medicine Retreat Europe
Light House Retreats: Plant Medicine Retreat EuropeLight House Retreats: Plant Medicine Retreat Europe
Light House Retreats: Plant Medicine Retreat Europe
 
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotesPromoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
 

Dressen-RSA-2019-preconference-data-workshop-copy.pptx

  • 1. RSA 2019, Toronto Preconference day March 16, 2019 11AM-1PM
  • 2. Programm  11-11:05 -- Introduction to the session and presenters  PRESENTATION OF PROJECTS  11:05-11:20 – Jodi: Mapping Titan, Mapping Paintings  11:20-11:35 – Catherine: Mapping Sculpture  PRESENTATION OF TOOLS  11:35-12:05 – Angela: OpenRefine, TimelineJS  12:05-12:35 – Catherine: Palladio, CARTO  Hands-on
  • 3.
  • 4. OpenRefine  Cleaning up messy data from a spreadsheet  Spelling errors  Uniform data  Removing whitespace  Splitting columns  Enriching data from external sources  Etc. You won’t be analysing your data one by one, but in groups and sets. Therefore the application is suitable for very large data sets.
  • 5. OpenRefine  Apart from cleaning data, you can also use Open Refine for different purposes  Word counts in sets  Combine sheets  Enriching reconciled data with Open Refine: Import data from Wikidata or VIAF
  • 6. OpenRefine  Free, open source software  Works best with Google Chrome (less with Safari and Explorer)  Written in Java. Requires Java JRE  Works with Interactive Data Transformation tools (IDTs), which allows to change a big data set at one time. It is similar to a spreadsheet, but has more functionalities.  Works as a destop application. It does not store your data. Save them! It may be used in several tabs contemporaneously.  The .exe file opens a terminal window as web application, where the little server is running. It needs to remain open. Runs offline through the terminal window.
  • 7. OpenRefine  Chose a project and upload it.  Rename project (save it later, Open Refine does not save or store automatically!!)  Use code UTF-8  Configure your data: You will be shown a preview of your data. In the lower blue field, make sure “Parse data as” is set to “CSV / TSV / separator-based files”. Where it says character encoding, click in the blank field next to it and select UTF-8 from the pop-up window of encodings. Make sure the first row with your column headers is recognized as headers (boldfaced) and not as your data. If it is not automatically recognized, check the click box for “Parse next ‘1’ line(s) as column headers”. Since our exercise file is a CSV, activate the radio button “commas (CSV)” as the separator.
  • 8. OpenRefine – basic clean up  Text facet -> cluster  Get rid of whitespace: «Edit cells» -> «Common transforms» -> «Trim leading and trailing whitespace» / «Collapse consecutive whitespace»  Divide columns: «Edit column» -> «Split into several columns…»  Reorder columns  Cluster: «Edit cells» -> «Cluster and edit…» (only works for entire clusters to be merged, no selection possible)  Replace: Edit cells -> replace  Undo/redo: step by step index in the menu  Cancelling: Text facet –> chose what to eliminate and place a star –> back to facet by star –> true –> under all – facet by star –> remove all matching rows
  • 9. OpenRefine - transform  Exchange values: Edit cells -> transform -> GREL language -> transform the value  Replace: value.replace(‘xx’, ‘x’)  Add characters to a column: “prefix” + value  Cleaning up a date to show only the year: datePart(value,'year')  GREL : General Refine Expression Language on GitHub https://github.com/OpenRefine/OpenRefine/wiki/Gen eral-Refine-Expression-Language
  • 10. OpenRefine – example from Wikipedia – Italian artists  Download table from Wikipedia  You want to separate names and years  Add column based on this column  Edit cells -> replace (to change the brakets into a colon, to be used later as idenfier)  Edit column – split into several columns (use colon as identifier)  Replace ) by null  Value + «, « + cells(«mycell»).value  Person separate: edit column – add column based on this column – value.split(« «)[1] ○ 1= last name / 0= first name  Add last name, first name together: value + «, « + cells[«Firstname»].value  Another option: Split cells: Choose ‘Edit cells’, ‘Split multi- valued cells’, entering ‘|’ as the value separator.
  • 11. OpenRefine for Data enrichment (using Linked Open Data)  Fetch URLs using Refine  Contruct URL queries to retrieve information from a simple web API  Using query services like:  Wikidata  Google maps API  VIAF (Virtual International Authority File)  etc.
  • 12. Retrieving data from Wikidata  You need a column Wikidata_uri  Create a column Wikidata_id: Edit column –> add column based on this column –> for the ID extraction enter value replace(value,"http://www.wikidata.org/entity/", "")  On Wikidata_id column: Edit column -> add column by fetching URLs -> if you want to query birth dates enter value «P569» ("https://tools.wmflabs.org/openrefine- wikidata/en/fetch_values?item="+value+"&prop=P56 9") -> name column «date_of_birth_Wikidata». The result is in JSON.  Clean data by -> edit cells -> transform -> for value enter forEach(value.parseJson().values,v,v).join(";")  Cleaning up a date to show only the year: datePart(value,'year')
  • 13. Retrieving data from Wikidata  Reconcile (how simple is this!!)  Chose source – Wikidata (in case include other columns too)  Start reconciling – record will be automatically linked to Wikidata (some rest has to be done manually)  Use values as identifiers
  • 14. OpenRefine - export  At the end: export your data set! (Open Refine does not change your original data set)  Single column export -> facet -> chose facet -> export csv  Full sheet export -> comma-separated value  It is also possible to only export parts of your sheet.
  • 15. OpenRefine tutorials  http://openrefine.org/  https://programminghistorian.org/en/lessons/cleaning -data-with-openrefine  https://github.com/miriamposner/get-started-with- openrefine/blob/master/get-started-with- openrefine.md  https://github.com/OpenRefine/OpenRefine/wiki/Doc umentation-For-Users  Retrieving data from Wikidata or VIAF https://medium.com/the-bytegeist-blog/enriching- reconciled-data-with-openrefine-89b885dcadbb  There are many more!!
  • 16.
  • 17. Timelines (selection)  Timeline JS (Northwestern University) https://news.northwestern.edu/stories/2012 /03/knight-lab-digital-timelines/ (with examples and spreadsheet)  Neatline – for Omeka http://docs.neatline.org/creating-records.html  Google Timeline https://www.google.com/maps/timeline?pb  Office Timelines (for Excel or Powerpoint) https://templates.office.com/en- us/Timelines?page=1
  • 18. TimelineJS With Google Chrome and Google Spreadsheets  Advantages  Easy to use for a chronological visualization  Incorporates maps and images from the web  Can be incorporated into Websites and Powerpoints  Disadvantages  Limited interactivity  Only uses images published on the web, not from own collection
  • 19. TimelineJS With Google Chrome  https://timeline.knightlab.com/   Botticelli spreadsheet: https://docs.google.com/spreadsheets/d/ 1BAg-2_XZM- Oap1cwQoftBcYjrJYBjXOSNOqdXBwQ WyY/edit#gid=0  Botticelli timeline (imbedded link to website or presentation)
  • 20. Thank you ! Dr. Angela Dressen Villa I Tatti, The Harvard University Center for Italian Renaissance Studies / Florenz, Italy adressen@itatti.harvard.edu Discipline Representative for Digital Humanities at the Renaissance Society of America (RSA)

Editor's Notes

  1. Cleaning up your own accumulated data or data gathered from the net. Works with an algorithm.
  2. Wikidata provides an endpoint for querying data as a URL. Once you know the property you would like to retrieve, the objective is to use OpenRefine to build a query string and retrieve the data you want from that endpoint.