Introduction into R for historians (part 1: introduction)

Quantitave research methods
Statistical Software
Introducing R vocabulary
Getting help
Installing R and RStudio
Introduction into R
Richard L. Zijdeman
28 May 2015
Richard L. Zijdeman Introduction into R

Getting help
1 Quantitave research methods
2 Statistical Software
3 Introducing R vocabulary
4 Getting help
5 Installing R and RStudio

Getting help

Getting help
Why
To answer descriptive and explanatory questions on populations

Getting help
Workﬂow: PTE
problem (research question)
theory (hypothesis)
empirical test . . . with loops between T-E and P-T-E

Getting help
Research Questions
descriptive (to what extent. . . )
comparative (comparing two entities)
trend (comparison over time)
explanatory (focus on mechanism at hand)

Getting help
Theory
deductive reasoning
explanans
general mechanism
condition
explanandum (hypothesis)

Getting help
Empirical test
sample vs. population
random vs. stratiﬁed samples
testing technique, e.g.:
T-test, correlation, regression
Software required for faster analysis

Getting help

Getting help
The dangers of analysing with spreadsheets
(e.g. MS Excel)
tempting to input and clean data in the same sheet
diﬃcult to track cleaning rules
defaults mess up your data (e.g. 01200 -> 1200)

Getting help
Why use syntax (scripting)
Eﬃciency (really)
Quality (error checking)
Replicatability
Communication

Getting help
R
R is open source, which is good and bad:
anybody can contribute (check, improve, create code)
free of charge
but: R depends on collective action
cannot ‘demand’ support
sprawl of packages

Getting help
RStudio
browser for R
provides easy access to:
scripts
data
plots
manual

Getting help

Getting help
R script
* series of commands to manipulate data
* always save your script, NEVER change your data
original data + script = reproducable research

Getting help
R Session
* contains scripts, data, functions
* can be saved 'workspace image'
* prefer not to:
+ sessions are usually cluttered
+ only useful if running script takes time

Getting help
Assignment
* 'attach' values to an object (e.g. a variable)
x <- 5
y <- 4
z <- x*y
print(z)
## [1] 20

Getting help
Assignment II
Try and imagine the potential of assignment
x <- c(4, 3, 2, 1, 0, 27, 34, 35)
# 'c' for concatenate values
y <- -1
z <- x*y
print(z)
## [1] -4 -3 -2 -1 0 -27 -34 -35

Getting help
Data.frame
basically a table
contains columns (variables)
contains rows (cases)
“ﬂat table” in Kees’ terminology
my.df <- data.frame(x,z)
str(my.df) # show STRucture
## 'data.frame': 8 obs. of 2 variables:
## $ x: num 4 3 2 1 0 27 34 35
## $ z: num -4 -3 -2 -1 0 -27 -34 -35

Getting help
Packages and libraries
base R (core product)
additional packages
CRAN repository
spread through ‘mirrors’
choose a local, but active mirror
Github
packages not on CRAN
development versions of CRAN libraries

Getting help
Getting help

Getting help
Build-in help: “?”
?[function] / ?[package]
e.g. “?plot” or “?graphics”
check the index for user guides and vignettes

Getting help
Cran website
Manuals
R FAQ
R Journal

Getting help
Online communities
Stackoverﬂow
Instance of Stackexchange
Reputation based Q&A
Speciﬁc lists for packages, e.g.:
ggplot2
R-sig-mixed-models

Getting help
Asking a question Getting an answer
Search the web: others must have had this problem too
If you raise a question:
be polite
be concise
short background
replicatable example
debrief your eﬀorts sofar

Getting help

Getting help
Download R
Instructions via http://www.r-project.org
Choose a CRAN mirror
http://cran.r-project.org/mirrors.html
close, but active too!
Romania hasn’t gone (yet!)
Click on ‘Download R for Windows’
Follow usual installation procedure
Double click on R
You should now have a working session!
Close the session, do not save workspace image

Getting help
RStudio
RStudio is found on http://www.rstudio.com
Download the version for your OS (e.g. windows)
http://www.rstudio.com/products/rstudio/download/
Install by double clicking on the downloaded ﬁle
Start RStudio by double clicking on the icon
You do not need to start R, before starting RStudio

Introduction into R for historians (part 1: introduction)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Introduction into R for historians (part 1: introduction)

Similar to Introduction into R for historians (part 1: introduction) (20)

More from Richard Zijdeman

More from Richard Zijdeman (14)

Recently uploaded

Recently uploaded (20)

Introduction into R for historians (part 1: introduction)