SlideShare a Scribd company logo
eResearch Africa Conference
Library Carpentry Workshop
5 May 2017
Kayleigh Roos & Erika Mias
Digital Curation Officers, UCT Libraries
Isak van der Walt
Senior IT Consultant, UP DLS Strategic Innovation
● Introduction
● Jargon busting
TEA
● Foundations
● Wrap-up & Feedback
LUNCH
● OpenRefine Intro & Basics
TEA
● OpenRefine Lesson
Intro to Data
Overview
Library Carpentry Workshop
5 May 2017
Introduction to Data
Library Carpentry Workshop
5 May 2017
● Stickies
Problematic terms/concepts/phrases Resolved terms/concepts/phrases
● Helpers
○ Instructors around the room who are not presenting
● computers are stupid, can frustrate, and as you all have different machines it can be tricky
to resolve problems.
○ Be patient, step aside, take a gulp of air, and put you red sticky up!
Intro to Data
Help!
Library Carpentry Workshop
5 May 2017
Help!
All good
● Video: James Baker
https://youtu.be/40GX3AwgREg
Intro to Data
What is Library Carpentry?
Library Carpentry Workshop
5 May 2017
Intro to Data
Still confused?
Library Carpentry Workshop
5 May 2017
Jargon Busting
Demystifying commonly used terms
in data science & software
Library Carpentry Workshop
5 May 2017
What the data?
Jargon Busting
Library Carpentry Workshop
5 May 2017
source: XKCD https://xkcd.com/1146/
List of technical terms a.k.a Jargon
Jargon Busting
Library Carpentry Workshop
5 May 2017
● Conference program e.g.:
○ Research Data Management
○ Open Stack
○ Data Repository
○ etc
● Library Carpentry
○ OpenRefine
○ Github
○ Regular expressions
○ Python
○ etc.
● Other
○ “
○
Breakaway groups
Jargon Busting
Library Carpentry Workshop
5 May 2017
Group discussion - working through the stickies
Jargon Busting
Library Carpentry Workshop
5 May 2017
CASRAI: dictionary of research administration information
Jargon Busting
Library Carpentry Workshop
5 May 2017
Foundations
Best practice & generic skills for working with and
understanding research software
Library Carpentry Workshop
5 May 2017
Foundations
Library Carpentry Workshop
5 May 2017
THE COMPUTER IS STUPID
Foundations
Library Carpentry Workshop
5 May 2017
THE COMPUTER IS STUPID
● computer only does what you tell it to. If it throws up an error it is often
not your fault, rather in most cases the computer has failed to interpret
what you mean because it can only work with what it knows
● if you find an error message frustrating, it isn’t the computer’s fault that it
is giving you an archaic and incomprehensible error message, you might
just need to re-think the way in which you asked it to do something
Foundations
Library Carpentry Workshop
5 May 2017
CARPENTRY: the skill to know which tool to use and which tools to learn when need be
● One of the fundamental theories of a carpenter is that they ‘know which tools to use’ to solve or work on
particular tasks
General consensus around the fact that so many Library processes could be automated
through the implementation of some simple programming skills
● Repetitive tasks + automation = time & effort saving
Automation
Foundations
Library Carpentry Workshop
5 May 2017
Automation
● Borrow, Borrow, and Borrow again;
● The correct language to learn is the one that
works in your local context
● Knowing (even a little) code helps you
evaluate projects that use code
● Automate to make the time to do something
else
Foundations
Library Carpentry Workshop
5 May 2017
Shortcuts
Keyboard shortcuts are your friend
● ctrl+s for save; ctrl+c for copy; ctrl+x for cut; ctrl+v for paste (cmd on Mac)
● alt+tab (windows) cmd+tab (mac) for moving between programmes
● ctrl+shift+tab (Windows) or ctrl+tab (Mac) for moving between browser tabs
… Your favourites?
Foundations
Library Carpentry Workshop
5 May 2017
Open formats
Plain text (or open) formats are your friend
● Why?
○ All computers can process them
○ Interoperability
● Types of open file formats?
○ .txt
○ .csv
○ .html
○ .xml
○ .jpg
File naming
Naming files sensible things is good for you and your computer!
• Three criteria to assist with naming files:
○ Organisation
○ Context
○ Consistency
• Elements to consider when naming files:
○ version numbers
○ creation / publication date
○ creator’s name / group name
○ content description
○ project number
• Always consider scalability when naming files
○ e.g. 001 vs 01
• Don’t
○ punctuation, or capital letters
○ use special characters or spaces
• Do
○ replace full-stops with underscores
○ replace spaces with dashes
○ keep to YYYY-MM-DD date format
○ keep file names relevant and as short as possible
Library Carpentry Workshop
5 May 2017
Foundations
http://theawkwardyeti.com/comic/misc/
File versioning
credit: PHD Comics
“Final”.doc http://phdcomics.com/comics.php?f=1531
Always record changes to your data files, even if it seems
unnecessary!
● Don’t use the word “final” - instead, number or date
versions
● Avoid using labels - eg. ‘draft’, ‘test’, ‘final’, ‘rev’, ‘corrected’,
etc
● Indicate major version changes with:
○ YYYY-MM-DD_Title_Author_V1
○ YYYY-MM-DD_Title_Author_V2
● Indicate minor version changes with:
○ YYYY-MM-DD_Title_Author_V1-1
○ YYYY-MM-DD_Title_Author_V1-2
Library Carpentry Workshop
5 May 2017
Foundations
Wrap-up
Remaining issues?
Feedback
Library Carpentry Workshop
5 May 2017

More Related Content

Similar to Data Intro for Librarians: Data Carpentry Workshop eRA2017

A Writing Group Strategy for Scientists
A Writing Group Strategy for ScientistsA Writing Group Strategy for Scientists
A Writing Group Strategy for Scientists
gizemk
 
Ace the Tech Interviews - www.hiredintech.com
Ace the Tech Interviews - www.hiredintech.comAce the Tech Interviews - www.hiredintech.com
Ace the Tech Interviews - www.hiredintech.com
Anton Dimitrov
 
Year4_DC_Project
Year4_DC_ProjectYear4_DC_Project
Year4_DC_Project
Miss Gutjahr
 
Technical Writing Overview: WTD Nigeria
Technical Writing Overview: WTD NigeriaTechnical Writing Overview: WTD Nigeria
Technical Writing Overview: WTD Nigeria
Margaret Fero
 
UPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer TidwellUPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer Tidwellnikrao
 
UPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer TidwellUPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer Tidwellguestf59d1c4
 
Never show a design you haven't tested
Never show a design you haven't testedNever show a design you haven't tested
Never show a design you haven't tested
Ida Aalen
 
Keith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst PracticesKeith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst Practices
Jack Molisani
 
Power of the Swarm - Agile Serbia Conference 2017
Power of the Swarm - Agile Serbia Conference 2017Power of the Swarm - Agile Serbia Conference 2017
Power of the Swarm - Agile Serbia Conference 2017
Petri Heiramo
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
National Information Standards Organization (NISO)
 
Dojo Masters - Collaborative Writing in Distributed Teams
Dojo Masters - Collaborative Writing in Distributed TeamsDojo Masters - Collaborative Writing in Distributed Teams
Dojo Masters - Collaborative Writing in Distributed Teams
John Hedtke
 
Creating Accessible Documents
Creating Accessible DocumentsCreating Accessible Documents
Creating Accessible Documents
Ana Thompson
 
Interface Design for Elearning - Tips and Tricks
Interface Design for Elearning - Tips and TricksInterface Design for Elearning - Tips and Tricks
Interface Design for Elearning - Tips and Tricks
Julie Dirksen
 
Presentation skills
Presentation skillsPresentation skills
Presentation skills
Dennis Cleofe
 
It's XP, Stupid
It's XP, StupidIt's XP, Stupid
It's XP, Stupid
Mike Harris
 
The essentials of the IT industry or What I wish I was taught about at Univer...
The essentials of the IT industry or What I wish I was taught about at Univer...The essentials of the IT industry or What I wish I was taught about at Univer...
The essentials of the IT industry or What I wish I was taught about at Univer...
Equal Experts
 
USC 100: Summer Presentation
USC 100: Summer PresentationUSC 100: Summer Presentation
USC 100: Summer Presentation
Richard Winters
 
Create your library
Create your libraryCreate your library
Create your library
Laurence Chen
 

Similar to Data Intro for Librarians: Data Carpentry Workshop eRA2017 (20)

A Writing Group Strategy for Scientists
A Writing Group Strategy for ScientistsA Writing Group Strategy for Scientists
A Writing Group Strategy for Scientists
 
Ace the Tech Interviews - www.hiredintech.com
Ace the Tech Interviews - www.hiredintech.comAce the Tech Interviews - www.hiredintech.com
Ace the Tech Interviews - www.hiredintech.com
 
Year4_DC_Project
Year4_DC_ProjectYear4_DC_Project
Year4_DC_Project
 
Technical Writing Overview: WTD Nigeria
Technical Writing Overview: WTD NigeriaTechnical Writing Overview: WTD Nigeria
Technical Writing Overview: WTD Nigeria
 
UPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer TidwellUPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer Tidwell
 
UPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer TidwellUPA2007 Designing Interfaces Jenifer Tidwell
UPA2007 Designing Interfaces Jenifer Tidwell
 
Never show a design you haven't tested
Never show a design you haven't testedNever show a design you haven't tested
Never show a design you haven't tested
 
Keith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst PracticesKeith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst Practices
 
Power of the Swarm - Agile Serbia Conference 2017
Power of the Swarm - Agile Serbia Conference 2017Power of the Swarm - Agile Serbia Conference 2017
Power of the Swarm - Agile Serbia Conference 2017
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Dojo Masters - Collaborative Writing in Distributed Teams
Dojo Masters - Collaborative Writing in Distributed TeamsDojo Masters - Collaborative Writing in Distributed Teams
Dojo Masters - Collaborative Writing in Distributed Teams
 
Creating Accessible Documents
Creating Accessible DocumentsCreating Accessible Documents
Creating Accessible Documents
 
Interface Design for Elearning - Tips and Tricks
Interface Design for Elearning - Tips and TricksInterface Design for Elearning - Tips and Tricks
Interface Design for Elearning - Tips and Tricks
 
Week 3
Week 3Week 3
Week 3
 
Presentation skills
Presentation skillsPresentation skills
Presentation skills
 
It's XP, Stupid
It's XP, StupidIt's XP, Stupid
It's XP, Stupid
 
The essentials of the IT industry or What I wish I was taught about at Univer...
The essentials of the IT industry or What I wish I was taught about at Univer...The essentials of the IT industry or What I wish I was taught about at Univer...
The essentials of the IT industry or What I wish I was taught about at Univer...
 
USC 100: Summer Presentation
USC 100: Summer PresentationUSC 100: Summer Presentation
USC 100: Summer Presentation
 
Create your library
Create your libraryCreate your library
Create your library
 
2015 ed psy 510 #3
2015 ed psy 510 #32015 ed psy 510 #3
2015 ed psy 510 #3
 

Recently uploaded

Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
Georgi Kodinov
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
AMB-Review
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
timtebeek1
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 

Recently uploaded (20)

Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 

Data Intro for Librarians: Data Carpentry Workshop eRA2017

  • 1. eResearch Africa Conference Library Carpentry Workshop 5 May 2017 Kayleigh Roos & Erika Mias Digital Curation Officers, UCT Libraries Isak van der Walt Senior IT Consultant, UP DLS Strategic Innovation
  • 2. ● Introduction ● Jargon busting TEA ● Foundations ● Wrap-up & Feedback LUNCH ● OpenRefine Intro & Basics TEA ● OpenRefine Lesson Intro to Data Overview Library Carpentry Workshop 5 May 2017
  • 3. Introduction to Data Library Carpentry Workshop 5 May 2017
  • 4. ● Stickies Problematic terms/concepts/phrases Resolved terms/concepts/phrases ● Helpers ○ Instructors around the room who are not presenting ● computers are stupid, can frustrate, and as you all have different machines it can be tricky to resolve problems. ○ Be patient, step aside, take a gulp of air, and put you red sticky up! Intro to Data Help! Library Carpentry Workshop 5 May 2017 Help! All good
  • 5. ● Video: James Baker https://youtu.be/40GX3AwgREg Intro to Data What is Library Carpentry? Library Carpentry Workshop 5 May 2017
  • 6. Intro to Data Still confused? Library Carpentry Workshop 5 May 2017
  • 7. Jargon Busting Demystifying commonly used terms in data science & software Library Carpentry Workshop 5 May 2017
  • 8. What the data? Jargon Busting Library Carpentry Workshop 5 May 2017 source: XKCD https://xkcd.com/1146/
  • 9. List of technical terms a.k.a Jargon Jargon Busting Library Carpentry Workshop 5 May 2017 ● Conference program e.g.: ○ Research Data Management ○ Open Stack ○ Data Repository ○ etc ● Library Carpentry ○ OpenRefine ○ Github ○ Regular expressions ○ Python ○ etc. ● Other ○ “ ○
  • 10. Breakaway groups Jargon Busting Library Carpentry Workshop 5 May 2017
  • 11. Group discussion - working through the stickies Jargon Busting Library Carpentry Workshop 5 May 2017
  • 12. CASRAI: dictionary of research administration information Jargon Busting Library Carpentry Workshop 5 May 2017
  • 13. Foundations Best practice & generic skills for working with and understanding research software Library Carpentry Workshop 5 May 2017
  • 14. Foundations Library Carpentry Workshop 5 May 2017 THE COMPUTER IS STUPID
  • 15. Foundations Library Carpentry Workshop 5 May 2017 THE COMPUTER IS STUPID ● computer only does what you tell it to. If it throws up an error it is often not your fault, rather in most cases the computer has failed to interpret what you mean because it can only work with what it knows ● if you find an error message frustrating, it isn’t the computer’s fault that it is giving you an archaic and incomprehensible error message, you might just need to re-think the way in which you asked it to do something
  • 16. Foundations Library Carpentry Workshop 5 May 2017 CARPENTRY: the skill to know which tool to use and which tools to learn when need be ● One of the fundamental theories of a carpenter is that they ‘know which tools to use’ to solve or work on particular tasks General consensus around the fact that so many Library processes could be automated through the implementation of some simple programming skills ● Repetitive tasks + automation = time & effort saving Automation
  • 17. Foundations Library Carpentry Workshop 5 May 2017 Automation ● Borrow, Borrow, and Borrow again; ● The correct language to learn is the one that works in your local context ● Knowing (even a little) code helps you evaluate projects that use code ● Automate to make the time to do something else
  • 18. Foundations Library Carpentry Workshop 5 May 2017 Shortcuts Keyboard shortcuts are your friend ● ctrl+s for save; ctrl+c for copy; ctrl+x for cut; ctrl+v for paste (cmd on Mac) ● alt+tab (windows) cmd+tab (mac) for moving between programmes ● ctrl+shift+tab (Windows) or ctrl+tab (Mac) for moving between browser tabs … Your favourites?
  • 19. Foundations Library Carpentry Workshop 5 May 2017 Open formats Plain text (or open) formats are your friend ● Why? ○ All computers can process them ○ Interoperability ● Types of open file formats? ○ .txt ○ .csv ○ .html ○ .xml ○ .jpg
  • 20. File naming Naming files sensible things is good for you and your computer! • Three criteria to assist with naming files: ○ Organisation ○ Context ○ Consistency • Elements to consider when naming files: ○ version numbers ○ creation / publication date ○ creator’s name / group name ○ content description ○ project number • Always consider scalability when naming files ○ e.g. 001 vs 01 • Don’t ○ punctuation, or capital letters ○ use special characters or spaces • Do ○ replace full-stops with underscores ○ replace spaces with dashes ○ keep to YYYY-MM-DD date format ○ keep file names relevant and as short as possible Library Carpentry Workshop 5 May 2017 Foundations http://theawkwardyeti.com/comic/misc/
  • 21. File versioning credit: PHD Comics “Final”.doc http://phdcomics.com/comics.php?f=1531 Always record changes to your data files, even if it seems unnecessary! ● Don’t use the word “final” - instead, number or date versions ● Avoid using labels - eg. ‘draft’, ‘test’, ‘final’, ‘rev’, ‘corrected’, etc ● Indicate major version changes with: ○ YYYY-MM-DD_Title_Author_V1 ○ YYYY-MM-DD_Title_Author_V2 ● Indicate minor version changes with: ○ YYYY-MM-DD_Title_Author_V1-1 ○ YYYY-MM-DD_Title_Author_V1-2 Library Carpentry Workshop 5 May 2017 Foundations