SlideShare a Scribd company logo
1 of 17
Vector
with regularly spaced numbers
> 1:10
[1] 1 2 3
> seq(1,10)
[1] 1 2 3
> seq(1,10,2)
[1] 1 3 5 7 9

4

5

6

7

8

9 10

4

5

6

7

8

9 10

• We have used both “:” operator and seq command
• Note the last command where we have used “2” as step,
which is the “by” argument of the seq command
Try some sequence
or seq commands ….
> seq(0,1, length=11)
[1] 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
> seq(4,10,by=0.5)
[1] 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
> seq(4,10,0.5)
[1] 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0

8.5

9.0

9.5 10.0

8.5

9.0

9.5 10.0

> seq(2,8,0.3)
[1] 2.0 2.3 2.6 2.9 3.2 3.5 3.8 4.1 4.4 4.7 5.0 5.3 5.6 5.9 6.2 6.5 6.8 7.1 7.4
7.7
[21] 8.0
> seq.int(2,8,0.3)
[1] 2.0 2.3 2.6 2.9 3.2 3.5 3.8 4.1 4.4 4.7 5.0 5.3 5.6 5.9 6.2 6.5 6.8 7.1 7.4
7.7
[21] 8.0
> seq(2,8,length.out=10)
[1] 2.000000 2.666667 3.333333 4.000000 4.666667 5.333333 6.000000 6.666667
7.333333
[10] 8.000000
Try more seq commands ….
> seq(1,5,0.3)
[1] 1.0 1.3 1.6 1.9 2.2 2.5 2.8 3.1 3.4 3.7 4.0 4.3 4.6 4.9
> pi:6
[1] 3.141593 4.141593 5.141593
> 6:pi
[1] 6 5 4
> 10:-2
[1] 10 9 8 7 6 5 4 3 2 1 0 -1 -2
> -7:8
[1] -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8

• You can generate decreasing sequence
• Try generating a sequence of negative numbers
Think and try …...
Generate a sequence of the following numbers:
0.0 0.2 0.4 0.6 0.8 1.0 1.0 2.0 3.0
6.0 7.0 8.0 9.0 10.0 100.0

4.0

Hints
• You have to use more than one sequence.
• But how will you include “100”?

5.0
Think and try ….. Possible Solution
Generate a sequence of the following numbers:
0.0 0.2 0.4 0.6 0.8 1.0 1.0 2.0 3.0
6.0 7.0 8.0 9.0 10.0 100.0

> seq(0, 1, length=6)
[1] 0.0 0.2 0.4 0.6 0.8 1.0
> seq.1<-seq(0, 1, length=6)
> c(seq.1,1:10,100)
[1]
0.0
0.2
0.4
0.6
[14]
8.0
9.0 10.0 100.0

0.8

1.0

1.0

2.0

3.0

4.0

4.0

5.0

5.0

6.0

7.0
Try replicate or rep command
> rep(1:5,2)
[1] 1 2 3 4 5 1 2 3 4 5
> rep(1:5, length=12)
[1] 1 2 3 4 5 1 2 3 4 5 1 2

> rep(c('one', 'two'), c(6, 3))
[1] "one" "one" "one" "one" "one" "one" "two" "two" "two"

Now enter help(rep) command and try the examples
Try replicate or rep command
> rep(1:4, each = 2)
[1] 1 1 2 2 3 3 4 4
> rep(1:4, c(2,2,2,2))
[1] 1 1 2 2 3 3 4 4
> rep(5:8, c(2,1,2,1))
[1] 5 5 6 7 7 8
> rep(1:4, each = 2, len = 4)
[1] 1 1 2 2

Hope you are enjoying as we go….. Have you noted the
arguments “each” and “len”gth? Now note the “times”
argument
> rep(1:4, each = 2, times = 3)
[1] 1 1 2 2 3 3 4 4 1 1 2 2 3 3 4 4 1 1 2 2 3 3 4 4
Try Histogram….
Suppose the top 25 ranked movies made the following gross receipts
for a Week:
29.6 28.2 19.6 13.7 13.0 7.8 3.4 2.0 1.9 1.0 0.7 0.4 0.4 0.3
0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.1 0.1 0.1 0.1 0.1
Scan the data and then draw some histograms.
> x
[1] 29.6 28.2 19.6 13.7 13.0
0.4 0.4 0.3 0.3 0.3
[17] 0.3 0.3 0.2 0.2 0.2
> receipts<-x
> hist(receipts)

7.8

3.4

2.0

1.9

1.0

0.1

0.1

0.1

0.1

0.1

0.7
Try Histogram….
Suppose the top 25 ranked movies made the following gross receipts
for a Week:
29.6 28.2 19.6 13.7 13.0 7.8 3.4 2.0 1.9 1.0 0.7 0.4 0.4 0.3
0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.1 0.1 0.1 0.1 0.1
Now try better histograms ….
Add colour, change colour, add title for the histogram, add title
for x-axis and then y-axis
> hist(receipts, col="red2")
> hist(receipts, col="red4")
> hist(receipts, col="red2",main="Gross Receipts for
first 25 ranked movies")
> hist(receipts, col="red2",main="Gross Receipts for
first 25 ranked movies",xlab="receipts in a week")
> hist(receipts, col="red2",main="Gross Receipts for
first 25 ranked movies",xlab="receipts in a
week",ylab="count of movies")
Now try better histograms ….
Your new histogram should look like this
Now try better histograms ….
Now put the range for x-axis and y-axis
> hist(receipts, col="red2",main="Gross Receipts for first 25
ranked movies",xlab="receipts in a week",ylab="count of
movies",xlim=c(0.1,35),ylim=c(0,25))
Now more about histograms ….
Now try breaks=….
What is “breaks”?
> hist(receipts,breaks=3,col="red2",main="Gross Receipts
for first 25 ranked movies",xlab="receipts in a
week",ylab="count of movies")

Remember:
Breaks is just a
suggestion to R
Now more about breaks ….
“breaks” can also specify the actual break points
in a histogram
> hist(receipts,breaks=c(0,1,2,3,4,5,10,20,max(x)),col="violetred")

Note the break points
Summary and Fivenum
Suppose, CEO yearly compensations are sampled and the
following are found (in millions).
12 0.4 5 2 50 8 3 1 4 0.25
> sals
[1] 12.00 0.40 5.00 2.00 50.00 8.00 3.00 1.00 4.00 0.25
> mean(sals) # the average
[1] 8.565
> var(sals) # the variance
[1] 225.5145
> sd(sals) # the standard deviation
[1] 15.01714
> median(sals) # the median
[1] 3.5
> summary(sals)
Min. 1st Qu. Median
Mean 3rd Qu.
Max.
0.250
1.250
3.500
8.565
7.250 50.000
> fivenum(sals) # min, lower hinge, Median, upper hinge, max
[1] 0.25 1.00 3.50 8.00 50.00
> quantile(sals)
0%
25%
50%
75% 100%
0.25 1.25 3.50 7.25 50.00
Important: Difference between
Fivenum and Quantiles
Difference between
Fivenum and Quantile:
Lower and Upper Hinge
The sorted data:
0.25 0.4 1 2 3 3.5 4 5 8 12 50
Median = 3.5

• The lower hinge is the median of all the data to the left of
the median (3.5), not counting this particular data point (if it
is one.)
• The upper hinge is similarly defined.

More Related Content

What's hot

ECMAScript 6 major changes
ECMAScript 6 major changesECMAScript 6 major changes
ECMAScript 6 major changes
hayato
 

What's hot (20)

מודלים חישוביים - תרגול מס 2 - אוניברסיטת חיפה
   מודלים חישוביים - תרגול מס 2 - אוניברסיטת חיפה    מודלים חישוביים - תרגול מס 2 - אוניברסיטת חיפה
מודלים חישוביים - תרגול מס 2 - אוניברסיטת חיפה
 
Calvix python
Calvix pythonCalvix python
Calvix python
 
Convolutional Neural Network
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural Network
 
Python Basics #1
Python Basics #1Python Basics #1
Python Basics #1
 
Wrangle 2016: (Lightning Talk) FizzBuzz in TensorFlow
Wrangle 2016: (Lightning Talk) FizzBuzz in TensorFlowWrangle 2016: (Lightning Talk) FizzBuzz in TensorFlow
Wrangle 2016: (Lightning Talk) FizzBuzz in TensorFlow
 
Pandas+postgre sql 實作 with code
Pandas+postgre sql 實作 with codePandas+postgre sql 實作 with code
Pandas+postgre sql 實作 with code
 
Introducción a Elixir
Introducción a ElixirIntroducción a Elixir
Introducción a Elixir
 
Python modulesfinal
Python modulesfinalPython modulesfinal
Python modulesfinal
 
The Ring programming language version 1.5.3 book - Part 61 of 184
The Ring programming language version 1.5.3 book - Part 61 of 184The Ring programming language version 1.5.3 book - Part 61 of 184
The Ring programming language version 1.5.3 book - Part 61 of 184
 
The Ring programming language version 1.2 book - Part 39 of 84
The Ring programming language version 1.2 book - Part 39 of 84The Ring programming language version 1.2 book - Part 39 of 84
The Ring programming language version 1.2 book - Part 39 of 84
 
Computational Linguistics week 10
 Computational Linguistics week 10 Computational Linguistics week 10
Computational Linguistics week 10
 
response of system for given transfer function
response of system for given transfer functionresponse of system for given transfer function
response of system for given transfer function
 
ECMAScript 6 major changes
ECMAScript 6 major changesECMAScript 6 major changes
ECMAScript 6 major changes
 
The Ring programming language version 1.5.4 book - Part 52 of 185
The Ring programming language version 1.5.4 book - Part 52 of 185The Ring programming language version 1.5.4 book - Part 52 of 185
The Ring programming language version 1.5.4 book - Part 52 of 185
 
Асинхронность и многопоточность в Яндекс.Такси — Дмитрий Курилов
Асинхронность и многопоточность в Яндекс.Такси — Дмитрий КуриловАсинхронность и многопоточность в Яндекс.Такси — Дмитрий Курилов
Асинхронность и многопоточность в Яндекс.Такси — Дмитрий Курилов
 
Lập trình Python cơ bản
Lập trình Python cơ bảnLập trình Python cơ bản
Lập trình Python cơ bản
 
The secrets of inverse brogramming
The secrets of inverse brogrammingThe secrets of inverse brogramming
The secrets of inverse brogramming
 
Program Language - Fall 2013
Program Language - Fall 2013 Program Language - Fall 2013
Program Language - Fall 2013
 
Math 3-H6
Math 3-H6Math 3-H6
Math 3-H6
 
How fast ist it really? Benchmarking in practice
How fast ist it really? Benchmarking in practiceHow fast ist it really? Benchmarking in practice
How fast ist it really? Benchmarking in practice
 

Viewers also liked (7)

SalesEmployment.com Candidate Benefits
SalesEmployment.com Candidate BenefitsSalesEmployment.com Candidate Benefits
SalesEmployment.com Candidate Benefits
 
Contaminación atmosférica
Contaminación atmosféricaContaminación atmosférica
Contaminación atmosférica
 
Trabajo de investigación.
Trabajo de investigación.Trabajo de investigación.
Trabajo de investigación.
 
Shopping
ShoppingShopping
Shopping
 
Eu Raffa & Erica....
Eu Raffa & Erica....Eu Raffa & Erica....
Eu Raffa & Erica....
 
Timisoara, Pillar of Development in Romania
Timisoara, Pillar of Development in RomaniaTimisoara, Pillar of Development in Romania
Timisoara, Pillar of Development in Romania
 
Aplicando equipos neumáticos a hospitales
Aplicando equipos neumáticos a hospitalesAplicando equipos neumáticos a hospitales
Aplicando equipos neumáticos a hospitales
 

Similar to Rpartii 131126003007-phpapp01

Parallel R in snow (english after 2nd slide)
Parallel R in snow (english after 2nd slide)Parallel R in snow (english after 2nd slide)
Parallel R in snow (english after 2nd slide)
Cdiscount
 
Matlab 1
Matlab 1Matlab 1
Matlab 1
asguna
 
Python for High School Programmers
Python for High School ProgrammersPython for High School Programmers
Python for High School Programmers
Siva Arunachalam
 

Similar to Rpartii 131126003007-phpapp01 (20)

01_introduction_lab.pdf
01_introduction_lab.pdf01_introduction_lab.pdf
01_introduction_lab.pdf
 
Optimization and Mathematical Programming in R and ROI - R Optimization Infra...
Optimization and Mathematical Programming in R and ROI - R Optimization Infra...Optimization and Mathematical Programming in R and ROI - R Optimization Infra...
Optimization and Mathematical Programming in R and ROI - R Optimization Infra...
 
Parallel R in snow (english after 2nd slide)
Parallel R in snow (english after 2nd slide)Parallel R in snow (english after 2nd slide)
Parallel R in snow (english after 2nd slide)
 
8th semester Computer Science and Information Science Engg (2013 December) Qu...
8th semester Computer Science and Information Science Engg (2013 December) Qu...8th semester Computer Science and Information Science Engg (2013 December) Qu...
8th semester Computer Science and Information Science Engg (2013 December) Qu...
 
r studio presentation.pptx
r studio presentation.pptxr studio presentation.pptx
r studio presentation.pptx
 
r studio presentation.pptx
r studio presentation.pptxr studio presentation.pptx
r studio presentation.pptx
 
[1062BPY12001] Data analysis with R / week 2
[1062BPY12001] Data analysis with R / week 2[1062BPY12001] Data analysis with R / week 2
[1062BPY12001] Data analysis with R / week 2
 
R programming language
R programming languageR programming language
R programming language
 
Class 30: Sex, Religion, and Politics
Class 30: Sex, Religion, and PoliticsClass 30: Sex, Religion, and Politics
Class 30: Sex, Religion, and Politics
 
6. Vectors – Data Frames
6. Vectors – Data Frames6. Vectors – Data Frames
6. Vectors – Data Frames
 
Elixir and OTP Apps introduction
Elixir and OTP Apps introductionElixir and OTP Apps introduction
Elixir and OTP Apps introduction
 
Learn Matlab
Learn MatlabLearn Matlab
Learn Matlab
 
R part I
R part IR part I
R part I
 
Welcome to python
Welcome to pythonWelcome to python
Welcome to python
 
Table of Useful R commands.
Table of Useful R commands.Table of Useful R commands.
Table of Useful R commands.
 
presentazione
presentazionepresentazione
presentazione
 
Basic practice of R
Basic practice of RBasic practice of R
Basic practice of R
 
The Ring programming language version 1.3 book - Part 16 of 88
The Ring programming language version 1.3 book - Part 16 of 88The Ring programming language version 1.3 book - Part 16 of 88
The Ring programming language version 1.3 book - Part 16 of 88
 
Matlab 1
Matlab 1Matlab 1
Matlab 1
 
Python for High School Programmers
Python for High School ProgrammersPython for High School Programmers
Python for High School Programmers
 

Recently uploaded

Recently uploaded (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Rpartii 131126003007-phpapp01

  • 1. Vector with regularly spaced numbers > 1:10 [1] 1 2 3 > seq(1,10) [1] 1 2 3 > seq(1,10,2) [1] 1 3 5 7 9 4 5 6 7 8 9 10 4 5 6 7 8 9 10 • We have used both “:” operator and seq command • Note the last command where we have used “2” as step, which is the “by” argument of the seq command
  • 2. Try some sequence or seq commands …. > seq(0,1, length=11) [1] 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 > seq(4,10,by=0.5) [1] 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 > seq(4,10,0.5) [1] 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 8.5 9.0 9.5 10.0 8.5 9.0 9.5 10.0 > seq(2,8,0.3) [1] 2.0 2.3 2.6 2.9 3.2 3.5 3.8 4.1 4.4 4.7 5.0 5.3 5.6 5.9 6.2 6.5 6.8 7.1 7.4 7.7 [21] 8.0 > seq.int(2,8,0.3) [1] 2.0 2.3 2.6 2.9 3.2 3.5 3.8 4.1 4.4 4.7 5.0 5.3 5.6 5.9 6.2 6.5 6.8 7.1 7.4 7.7 [21] 8.0 > seq(2,8,length.out=10) [1] 2.000000 2.666667 3.333333 4.000000 4.666667 5.333333 6.000000 6.666667 7.333333 [10] 8.000000
  • 3. Try more seq commands …. > seq(1,5,0.3) [1] 1.0 1.3 1.6 1.9 2.2 2.5 2.8 3.1 3.4 3.7 4.0 4.3 4.6 4.9 > pi:6 [1] 3.141593 4.141593 5.141593 > 6:pi [1] 6 5 4 > 10:-2 [1] 10 9 8 7 6 5 4 3 2 1 0 -1 -2 > -7:8 [1] -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 • You can generate decreasing sequence • Try generating a sequence of negative numbers
  • 4. Think and try …... Generate a sequence of the following numbers: 0.0 0.2 0.4 0.6 0.8 1.0 1.0 2.0 3.0 6.0 7.0 8.0 9.0 10.0 100.0 4.0 Hints • You have to use more than one sequence. • But how will you include “100”? 5.0
  • 5. Think and try ….. Possible Solution Generate a sequence of the following numbers: 0.0 0.2 0.4 0.6 0.8 1.0 1.0 2.0 3.0 6.0 7.0 8.0 9.0 10.0 100.0 > seq(0, 1, length=6) [1] 0.0 0.2 0.4 0.6 0.8 1.0 > seq.1<-seq(0, 1, length=6) > c(seq.1,1:10,100) [1] 0.0 0.2 0.4 0.6 [14] 8.0 9.0 10.0 100.0 0.8 1.0 1.0 2.0 3.0 4.0 4.0 5.0 5.0 6.0 7.0
  • 6. Try replicate or rep command > rep(1:5,2) [1] 1 2 3 4 5 1 2 3 4 5 > rep(1:5, length=12) [1] 1 2 3 4 5 1 2 3 4 5 1 2 > rep(c('one', 'two'), c(6, 3)) [1] "one" "one" "one" "one" "one" "one" "two" "two" "two" Now enter help(rep) command and try the examples
  • 7. Try replicate or rep command > rep(1:4, each = 2) [1] 1 1 2 2 3 3 4 4 > rep(1:4, c(2,2,2,2)) [1] 1 1 2 2 3 3 4 4 > rep(5:8, c(2,1,2,1)) [1] 5 5 6 7 7 8 > rep(1:4, each = 2, len = 4) [1] 1 1 2 2 Hope you are enjoying as we go….. Have you noted the arguments “each” and “len”gth? Now note the “times” argument > rep(1:4, each = 2, times = 3) [1] 1 1 2 2 3 3 4 4 1 1 2 2 3 3 4 4 1 1 2 2 3 3 4 4
  • 8. Try Histogram…. Suppose the top 25 ranked movies made the following gross receipts for a Week: 29.6 28.2 19.6 13.7 13.0 7.8 3.4 2.0 1.9 1.0 0.7 0.4 0.4 0.3 0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.1 0.1 0.1 0.1 0.1 Scan the data and then draw some histograms. > x [1] 29.6 28.2 19.6 13.7 13.0 0.4 0.4 0.3 0.3 0.3 [17] 0.3 0.3 0.2 0.2 0.2 > receipts<-x > hist(receipts) 7.8 3.4 2.0 1.9 1.0 0.1 0.1 0.1 0.1 0.1 0.7
  • 9. Try Histogram…. Suppose the top 25 ranked movies made the following gross receipts for a Week: 29.6 28.2 19.6 13.7 13.0 7.8 3.4 2.0 1.9 1.0 0.7 0.4 0.4 0.3 0.3 0.3 0.3 0.3 0.2 0.2 0.2 0.1 0.1 0.1 0.1 0.1
  • 10. Now try better histograms …. Add colour, change colour, add title for the histogram, add title for x-axis and then y-axis > hist(receipts, col="red2") > hist(receipts, col="red4") > hist(receipts, col="red2",main="Gross Receipts for first 25 ranked movies") > hist(receipts, col="red2",main="Gross Receipts for first 25 ranked movies",xlab="receipts in a week") > hist(receipts, col="red2",main="Gross Receipts for first 25 ranked movies",xlab="receipts in a week",ylab="count of movies")
  • 11. Now try better histograms …. Your new histogram should look like this
  • 12. Now try better histograms …. Now put the range for x-axis and y-axis > hist(receipts, col="red2",main="Gross Receipts for first 25 ranked movies",xlab="receipts in a week",ylab="count of movies",xlim=c(0.1,35),ylim=c(0,25))
  • 13. Now more about histograms …. Now try breaks=…. What is “breaks”? > hist(receipts,breaks=3,col="red2",main="Gross Receipts for first 25 ranked movies",xlab="receipts in a week",ylab="count of movies") Remember: Breaks is just a suggestion to R
  • 14. Now more about breaks …. “breaks” can also specify the actual break points in a histogram > hist(receipts,breaks=c(0,1,2,3,4,5,10,20,max(x)),col="violetred") Note the break points
  • 15. Summary and Fivenum Suppose, CEO yearly compensations are sampled and the following are found (in millions). 12 0.4 5 2 50 8 3 1 4 0.25 > sals [1] 12.00 0.40 5.00 2.00 50.00 8.00 3.00 1.00 4.00 0.25 > mean(sals) # the average [1] 8.565 > var(sals) # the variance [1] 225.5145 > sd(sals) # the standard deviation [1] 15.01714 > median(sals) # the median [1] 3.5 > summary(sals) Min. 1st Qu. Median Mean 3rd Qu. Max. 0.250 1.250 3.500 8.565 7.250 50.000 > fivenum(sals) # min, lower hinge, Median, upper hinge, max [1] 0.25 1.00 3.50 8.00 50.00 > quantile(sals) 0% 25% 50% 75% 100% 0.25 1.25 3.50 7.25 50.00
  • 17. Difference between Fivenum and Quantile: Lower and Upper Hinge The sorted data: 0.25 0.4 1 2 3 3.5 4 5 8 12 50 Median = 3.5 • The lower hinge is the median of all the data to the left of the median (3.5), not counting this particular data point (if it is one.) • The upper hinge is similarly defined.