SAKAUE Akkuş Tatsuya 2011-12-17
Agenda 1. R 2. 3. 4. 5. R
Agenda 1. R 2. 3. 4. 5. R
• • Ross Ihaka & Robert Gentleman (R & R) • Ross Ihaka and Robert Gentleman. R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics, 5(3):299-314, 1996. • http://biostat.mc.vanderbilt.edu/twiki/pub/Main/Je reyHorner/JCGSR.pdf •
- 5. Tuesday, December 6, 11 10
• IBM SPSS Statistics 10 ... • •
Excel SPSS ... • • • Excel •
-Install / Update / Uninstall-
1. http://www.r-project.org/ 2. "download R" 3. JAPAN 4. OS
• Windows: : http://cran.md.tsukuba.ac.jp/bin/windows/base/ • Download R 2.14.0 for Windows • Mac OS X: http://cran.md.tsukuba.ac.jp/bin/macosx/ • R-2.14.0.pkg (latest version)
- 11. Tuesday, December 6, 11 17
- 12. Tuesday, December 6, 11 18
- 13. Tuesday, December 6, 11 19
- 14. Tuesday, December 6, 11 20
- 15. Tuesday, December 6, 11 21
- 16. Tuesday, December 6, 11 22
1. http://www.r-project.org/ 2. 3.
• • • Windows •
• Windows • [ Windows XP] → • [ Windows 7 ] → • Mac OS X • /Applications /Library Frameworks R. framework CleanApp
• Windows • START → Program → R → R 2.14.0 • Mac OS X • /Applications R R
• Windows Mac OS X 1. q() 2. 3. R
... • help(sth) • seekR (http://seekr.jp/) • R SEEK (http://www.rseek.org/) • RjpWiki (http://www.okada.jp.org/RWiki/) • R-Tips (http://cse.naro.a rc.go.jp/takezawa/r-tips/r.html) •R (http://aoki2.si.gunma-u.ac.jp/R/)
• • • help(sth) !!
Agenda 1. R 2. 3. 4. 5. R
Agenda 1. R 2. 3. 4. 5. R
- 26. Tuesday, December 6, 11 33
• [Enter] • > 3+5 [Enter] • > 10-3 [Enter] • > 2*3 [Enter] • > 100/20 [Enter] • > (12 + 34 -56) * 78 / 90 [Enter]
- 28. Tuesday, December 6, 11 35
- 29. Tuesday, December 6, 11 36
"I dont know !" by fmgbain http://www.ﬂickr.com/photos/fmgbain/4382010455/
- 31. Tuesday, December 6, 11 38
sqrt() • > sqrt(2) • > sqrt(144) • > sqrt(104976)
( ) • •q() help(sth) •
- 34. Tuesday, December 6, 11 41
- 35. Tuesday, December 6, 11 42
- 36. Tuesday, December 6, 11 43
- 37. Tuesday, December 6, 11 44
"I dont know !" by fmgbain http://www.ﬂickr.com/photos/fmgbain/4382010455/
- 39. Tuesday, December 6, 11 46
"hako" • > hako <- c(1,2,3,4,5) • > hako • c() concatenate/combine •
c() "<-" hako <- c(1,2,3,4,5) "<-" ←
+ ... • [Enter] • [STOP] • [Esc]
• • Tab
hako 1 5 5
- 45. Tuesday, December 6, 11 52
• > sqrt(hako) • > log(hako)
sqrt(1), sqrt(2) ... sqrt(5) log(1), log (2) ... log (5)
- 48. Tuesday, December 6, 11 55
•R • • q(), help(), sqrt(), log(), c() •
Agenda 1. R 2. 3. 4. 5. R
Agenda 1. R 2. 3. 4. 5. R
- 52. Tuesday, December 6, 11 59
- 53. Tuesday, December 6, 11 60
- 54. Tuesday, December 6, 11 61
... • • • TOEIC
... A 180 75 B 170 65 C 165 60 D 175 70 E 190 80
... • Excel • →
• ... • •
• Windows → • "MyDocuments" • Mac OS X → • Linux up to you...
• • getwd() • > getwd() • setwd() • > setwd("/Users/sakaue/Desktop/")
• read.csv() • CSV • CSV: Comma Separated Value
1. demo.csv • XLS/XLSX • CSV UTF-8 2. > test <- read.csv("demo.csv") 3. > test [Enter]
CSV ...
• read.delim() • • delim: delimiter
1. demo.xls 2. > test2 <- read.delim("clipboard") Mac : read.delim(pipe("pbpaste")) 3. > test2 [Enter]
> table(test2[,1]) • 1 > mean(test2[,2]) • 2 > hist(test2[,2]) • 2
• • CSV read.csv() • or
CSV Excel "y2.d175 | Lasershow! Relax!" by B Rosen http://www.ﬂickr.com/photos/rosengrant/4751386872/
Agenda 1. R 2. 3. 4. 5. R
Agenda 1. R 2. 3. 4. 5. R
- 71. Tuesday, December 6, 11 79
- 72. Tuesday, December 6, 11 80
1. 2.
2 Excel SPSS ... orz
- 75. Tuesday, December 6, 11 83
> age <- c(18, 23, 14, 19, 21, 29, 22, 21, 23, 19, 20, 20, 26, 18, 14, 6, 8, 16, 23, 20) > hist(age)
- 77. Tuesday, December 6, 11 85
> score <- c(60, 50, 72, 43, 50, 55, 43, 50, 85, 40) > words <- c(340, 190, 465, 170, 130, 225,140, 310, 580, 120) > plot(score,words)
- 79. Tuesday, December 6, 11 87
> nns <- c(350, 285, 315, 340, 210, 185, 120, 740, 425, 155) > ns <- c(365, 570, 645, 540, 645, 665, 880, 550, 410, 585) > boxplot(nns, ns, names=c("NNS", "NS"))
1. 2. 3. twitter, FB
Excel "y2.d175 | Lasershow! Relax!" by B Rosen http://www.ﬂickr.com/photos/rosengrant/4751386872/
Agenda 1. R 2. 3. 4. 5. R
Agenda 1. R 2. 3. 4. 5. R
- 85. Tuesday, December 6, 11 93
t
t
• H0 • H1 • H0 (t, χ2, F ) •
t • •R t.test()
t > like <- c(6,10,6,10,7,8,7,9,10,4) > dislike <- c(3,5,6,4,4,8,4,5,4,7) > t.test(like,dislike,var.equal=TRUE)
t > t.test(like,dislike,var.equal=TRUE) Two Sample t-test data: like and dislike t = 3.3041, df = 18, p-value = 0.003946 alternative hypothesis: true difference in means is not equal to 0 # 95 percent confidence interval: # 0.9831754 4.4168246 sample estimates: mean of x mean of y 7.7 5.0
- 92. Tuesday, December 6, 11 101
• • • A B •
: "however" 109 347 8 493 [ ] However, .... [ ] ..., however, .... [ ] ..., however.
> freq <- c(109,347,8) > chisq.test(freq,correct=FALSE) Chi-squared test for given probabilities data: freq X-squared = 391.7371, df = 2, p-value < 2.2e-16 # 2 # http://homepage2.nifty.com/nandemoarchive/toukei_kiso/t_F_chi.htm
• • t • • ! • ...
- 97. Tuesday, December 6, 11 107
One more thing...
Package
Package • • base 1,000 • package • ex. RMeCab
RMeCab
RMeCab • •R MeCab • R
• RMeCabText() : • RMeCabFreq() : • Ngram() : N-gram • collocate() :
- 104. Tuesday, December 6, 11 115
- 105. Tuesday, December 6, 11 117
2,940 1,785 3,780
twitter: @sakaue e-mail: tsakaue<AT>hiroshima-u.ac.jp

