Quantitative 
Methods 
for 
Lawyers 
R Boot Camp - Bonus 
Additional Resources 
Data Cleaning 
professor daniel martin katz computationallegalstudies.com @ computational
Often you will not 
know in advance 
how to solve your 
problem
Great news is that 
there is amount 
+∞ 
online content 
to help you
http://rseek.org/
When you do not know how to do 
something try your query here 
http://stackoverflow.com/
or iteratively query google until 
you find some documentation
also feel free to poke around 
on Bloggers
Some Additional 
Resources to 
Learn
https://www.datacamp.com/courses/introduction-to-r
http://www.ats.ucla.edu/stat/r/seminars/intro.htm
http://www3.nd.edu/~mclark19/learn/Introduction_to_R.pdf
Data Cleaning / 
Preprocessing is typically a 
very important thing that 
thwarts many inquiries
http://www.nytimes.com/2014/08/18/technology/for-big-data-scientists-hurdle-to-insights-is-janitor-work.html
“Cleaning up data to 
the point where you 
can work with it is a 
huge amount of 
work. 
If you’re trying to 
reconcile a lot of 
sources of data that 
you don’t control 
like in this flight 
search example, it 
can take 80% of 
your time.” 
http://jvns.ca/blog/2014/06/19/machine-learning-isnt-kaggle-competitions/
http://blog.revolutionanalytics.com/2014/08/data-cleaning-is-a-critical-part-of-the-data-science-process.html
http://101.datascience.community/2013/03/22/r-commands-for-cleaning-data/
https://www.coursera.org/course/getdata
http://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo- 
Introduction_to_data_cleaning_with_R.pdf

Quantitative Methods for Lawyers - R Boot Camp Bonus Module - Professor Daniel Martin Katz