SlideShare a Scribd company logo
Stat310            Sequences of rvs


                            Hadley Wickham
Wednesday, 17 March 2010
Major’s day
                   2:30-4:30pm Today
                   Oshman Engineering Design Kitchen


                   Come along and talk to me (or Rudy
                   Guerra) if you’re interested in becoming a
                   stat major


Wednesday, 17 March 2010
Assessment

                   Test model answers online tonight
                   (hopefully)
                   Usual help session tonight 4-5pm.




Wednesday, 17 March 2010
1. Sequences
               2. Limits
               3. Chebyshev’s theorem
               4. The law of large numbers
               5. The central limit theorem



Wednesday, 17 March 2010
Sequences

                   1 variable: X
                   2 variables: X, Y
                   ...
                   n variables: X1, X2, X3, ..., Xn




Wednesday, 17 March 2010
Sequences
                   Xi ~ Normal(μi, σi)
                   Xi ~ Normal(μ, σi)
                   Xi ~ Normal(μi, σ)
                   Xi ~ Normal(μ, σ)
                   Almost always assume that the Xi’s are
                   independent. In the last case they are
                   also identically distributed.


Wednesday, 17 March 2010
iid = independent &
                identically distributed


Wednesday, 17 March 2010
Your turn

                   Xi are iid N(0, 2).
                   What is E(X30)? What is Var(X2001)?
                   What is Cor(X10, X11)? Cor(X1, X1000)?




Wednesday, 17 March 2010
n
                                               n
                                                
               E(                   Xi ) =           E(Xi )
                               i                 i
                           n
                                                n
                                                 
         V ar(                      ai Xi ) =         2
                                                     ai V   ar(Xi )
                               i                 i
                                                     If what is true?
                       n
                                           n
                                            
          E(                       Xi ) =        E(Xi )
                           i                 i        If what is true?
Wednesday, 17 March 2010
Limits
                   Typically will define some function of n
                                           ¯
                   random variables, e.g. Xn
                                   ¯
                   What happens to Xn when n → ∞?
                   Why? Because often it will converge, and
                   we can use this to approximate results for
                   any large n.



Wednesday, 17 March 2010
New notation

                   If xn → 0, and n is big, we can say xn ≈ 0.
                   If Xn → Z, Z ~ N(0, 1), and n is big,
                   we can say Xn ~ . N(0,1).

                   Read as approximately distributed.
                   Other ways to write it



Wednesday, 17 March 2010
N
                                       go

                                          o
                                        od
                                             lim art
                           Chebyshev




                                                it ing
                                            st

                                                  -b p
                                                     ut oin
                                                       a t
                                1
         P (|X − µ|  Kσ) ≥ 1 − 2
                               K
                             1
         P (|X − µ|  Kσ) ≤ 2
                            K
                                        For K  0
Wednesday, 17 March 2010
Your turn

                   How can you put this in words?
                                      1
                   P (|X − µ|  Kσ) ≤ 2
                                     K


Wednesday, 17 March 2010
The probability of being more
                               than K standard deviations
        80                     away from the mean is less
                               than one over K squared.
        60
                               (For K  0)
 1 K2




        40




        20




                 0         2     4           6     8       10
                                     K
Wednesday, 17 March 2010
(For K  1)
        1.0




        0.8




        0.6
 1 K2




        0.4




        0.2




        0.0

                           2   4       6   8           10
                                   K
Wednesday, 17 March 2010
Your turn

                   How does this compare to the normal
                   distribution? Compare the probability of
                   being less than 1, 2 and 3 standard
                   deviations away from the mean given by
                   Chebychev and what we know about the
                   normal.



Wednesday, 17 March 2010
1.0




         0.8




         0.6

                                                    variable
 value




                                                        cheby
                                                        norm
         0.4




         0.2




         0.0

                           2   4       6   8   10
                                   x
Wednesday, 17 March 2010
LLN
                   Law of large numbers
                   X1, X2, ..., Xn iid.

                           n
                           
                  ¯
                  Xn =          Xi
                            i


                   There are five ways to write the result.


Wednesday, 17 March 2010
What does it mean?
                   As we collect more and more data, the
                   sample mean gets closer and closer to
                   the true mean.
                   Not that surprising!
                   But note that we didn’t make any
                   assumptions about the distributions



Wednesday, 17 March 2010
CLT

                   Central limit theorem.
                   The distribution of a mean is normal when
                   gets big.




Wednesday, 17 March 2010
Approximation


                   This implies that if n is big then ...




Wednesday, 17 March 2010
Reading


                Section 4.1
                Focus on the general ideas and the
                defintions




Wednesday, 17 March 2010

More Related Content

Viewers also liked

Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2
Chris Rucker
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Hadley Wickham
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 series
Vivian S. Zhang
 
Reshaping Data in R
Reshaping Data in RReshaping Data in R
Reshaping Data in R
Jeffrey Breen
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in R
apolol92
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function
Sakthi Dasans
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
Romain Francois
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
Ram Narasimhan
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
Vivian S. Zhang
 

Viewers also liked (20)

20 date-times
20 date-times20 date-times
20 date-times
 
04 Wrapup
04 Wrapup04 Wrapup
04 Wrapup
 
Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2
 
24 modelling
24 modelling24 modelling
24 modelling
 
21 spam
21 spam21 spam
21 spam
 
03 Conditional
03 Conditional03 Conditional
03 Conditional
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)
 
Graphical inference
Graphical inferenceGraphical inference
Graphical inference
 
03 Modelling
03 Modelling03 Modelling
03 Modelling
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 series
 
23 data-structures
23 data-structures23 data-structures
23 data-structures
 
R packages
R packagesR packages
R packages
 
02 Ddply
02 Ddply02 Ddply
02 Ddply
 
01 Intro
01 Intro01 Intro
01 Intro
 
Reshaping Data in R
Reshaping Data in RReshaping Data in R
Reshaping Data in R
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in R
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 

More from Hadley Wickham (20)

27 development
27 development27 development
27 development
 
27 development
27 development27 development
27 development
 
22 spam
22 spam22 spam
22 spam
 
19 tables
19 tables19 tables
19 tables
 
18 cleaning
18 cleaning18 cleaning
18 cleaning
 
17 polishing
17 polishing17 polishing
17 polishing
 
16 critique
16 critique16 critique
16 critique
 
15 time-space
15 time-space15 time-space
15 time-space
 
14 case-study
14 case-study14 case-study
14 case-study
 
13 case-study
13 case-study13 case-study
13 case-study
 
12 adv-manip
12 adv-manip12 adv-manip
12 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
10 simulation
10 simulation10 simulation
10 simulation
 
10 simulation
10 simulation10 simulation
10 simulation
 
09 bootstrapping
09 bootstrapping09 bootstrapping
09 bootstrapping
 
08 functions
08 functions08 functions
08 functions
 
07 problem-solving
07 problem-solving07 problem-solving
07 problem-solving
 
06 data
06 data06 data
06 data
 
05 subsetting
05 subsetting05 subsetting
05 subsetting
 

Recently uploaded

National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 

Recently uploaded (20)

National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 

16 Sequences

  • 1. Stat310 Sequences of rvs Hadley Wickham Wednesday, 17 March 2010
  • 2. Major’s day 2:30-4:30pm Today Oshman Engineering Design Kitchen Come along and talk to me (or Rudy Guerra) if you’re interested in becoming a stat major Wednesday, 17 March 2010
  • 3. Assessment Test model answers online tonight (hopefully) Usual help session tonight 4-5pm. Wednesday, 17 March 2010
  • 4. 1. Sequences 2. Limits 3. Chebyshev’s theorem 4. The law of large numbers 5. The central limit theorem Wednesday, 17 March 2010
  • 5. Sequences 1 variable: X 2 variables: X, Y ... n variables: X1, X2, X3, ..., Xn Wednesday, 17 March 2010
  • 6. Sequences Xi ~ Normal(μi, σi) Xi ~ Normal(μ, σi) Xi ~ Normal(μi, σ) Xi ~ Normal(μ, σ) Almost always assume that the Xi’s are independent. In the last case they are also identically distributed. Wednesday, 17 March 2010
  • 7. iid = independent & identically distributed Wednesday, 17 March 2010
  • 8. Your turn Xi are iid N(0, 2). What is E(X30)? What is Var(X2001)? What is Cor(X10, X11)? Cor(X1, X1000)? Wednesday, 17 March 2010
  • 9. n n E( Xi ) = E(Xi ) i i n n V ar( ai Xi ) = 2 ai V ar(Xi ) i i If what is true? n n E( Xi ) = E(Xi ) i i If what is true? Wednesday, 17 March 2010
  • 10. Limits Typically will define some function of n ¯ random variables, e.g. Xn ¯ What happens to Xn when n → ∞? Why? Because often it will converge, and we can use this to approximate results for any large n. Wednesday, 17 March 2010
  • 11. New notation If xn → 0, and n is big, we can say xn ≈ 0. If Xn → Z, Z ~ N(0, 1), and n is big, we can say Xn ~ . N(0,1). Read as approximately distributed. Other ways to write it Wednesday, 17 March 2010
  • 12. N go o od lim art Chebyshev it ing st -b p ut oin a t 1 P (|X − µ| Kσ) ≥ 1 − 2 K 1 P (|X − µ| Kσ) ≤ 2 K For K 0 Wednesday, 17 March 2010
  • 13. Your turn How can you put this in words? 1 P (|X − µ| Kσ) ≤ 2 K Wednesday, 17 March 2010
  • 14. The probability of being more than K standard deviations 80 away from the mean is less than one over K squared. 60 (For K 0) 1 K2 40 20 0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 15. (For K 1) 1.0 0.8 0.6 1 K2 0.4 0.2 0.0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 16. Your turn How does this compare to the normal distribution? Compare the probability of being less than 1, 2 and 3 standard deviations away from the mean given by Chebychev and what we know about the normal. Wednesday, 17 March 2010
  • 17. 1.0 0.8 0.6 variable value cheby norm 0.4 0.2 0.0 2 4 6 8 10 x Wednesday, 17 March 2010
  • 18. LLN Law of large numbers X1, X2, ..., Xn iid. n ¯ Xn = Xi i There are five ways to write the result. Wednesday, 17 March 2010
  • 19. What does it mean? As we collect more and more data, the sample mean gets closer and closer to the true mean. Not that surprising! But note that we didn’t make any assumptions about the distributions Wednesday, 17 March 2010
  • 20. CLT Central limit theorem. The distribution of a mean is normal when gets big. Wednesday, 17 March 2010
  • 21. Approximation This implies that if n is big then ... Wednesday, 17 March 2010
  • 22. Reading Section 4.1 Focus on the general ideas and the defintions Wednesday, 17 March 2010