16 Sequences

•

0 likes•575 views

This document contains notes from a statistics course. It discusses sequences of random variables, limits, Chebyshev's theorem, the law of large numbers, and the central limit theorem. The notes provide definitions and examples of key concepts like independent and identically distributed random variables, convergence in probability, and how the sample mean approximates the population mean as the sample size increases. Students are prompted with practice problems applying the concepts.

Technology News & Politics

Stat310 Sequences of rvs

Hadley Wickham
Wednesday, 17 March 2010

Major’s day
2:30-4:30pm Today
Oshman Engineering Design Kitchen

Come along and talk to me (or Rudy
Guerra) if you’re interested in becoming a
stat major

Wednesday, 17 March 2010

Assessment

Test model answers online tonight
(hopefully)
Usual help session tonight 4-5pm.

Wednesday, 17 March 2010

1. Sequences
2. Limits
3. Chebyshev’s theorem
4. The law of large numbers
5. The central limit theorem

Wednesday, 17 March 2010

Sequences

1 variable: X
2 variables: X, Y
...
n variables: X1, X2, X3, ..., Xn

Wednesday, 17 March 2010

Sequences
Xi ~ Normal(μi, σi)
Xi ~ Normal(μ, σi)
Xi ~ Normal(μi, σ)
Xi ~ Normal(μ, σ)
Almost always assume that the Xi’s are
independent. In the last case they are
also identically distributed.

Wednesday, 17 March 2010

iid = independent &
identically distributed

Wednesday, 17 March 2010

Your turn

Xi are iid N(0, 2).
What is E(X30)? What is Var(X2001)?
What is Cor(X10, X11)? Cor(X1, X1000)?

Wednesday, 17 March 2010

n
n

E( Xi ) = E(Xi )
i i
n
n

V ar( ai Xi ) = 2
ai V ar(Xi )
i i
If what is true?
n
n

E( Xi ) = E(Xi )
i i If what is true?
Wednesday, 17 March 2010

Limits
Typically will deﬁne some function of n
¯
random variables, e.g. Xn
¯
What happens to Xn when n → ∞?
Why? Because often it will converge, and
we can use this to approximate results for
any large n.

Wednesday, 17 March 2010

New notation

If xn → 0, and n is big, we can say xn ≈ 0.
If Xn → Z, Z ~ N(0, 1), and n is big,
we can say Xn ~ . N(0,1).

Read as approximately distributed.
Other ways to write it

Wednesday, 17 March 2010

N
go

o
od
lim art
Chebyshev

it ing
st

-b p
ut oin
a t
1
P (|X − µ| Kσ) ≥ 1 − 2
K
1
P (|X − µ| Kσ) ≤ 2
K
For K 0
Wednesday, 17 March 2010

Your turn

How can you put this in words?
1
P (|X − µ| Kσ) ≤ 2
K

Wednesday, 17 March 2010

The probability of being more
than K standard deviations
80 away from the mean is less
than one over K squared.
60
(For K 0)
1 K2

40

20

0 2 4 6 8 10
K
Wednesday, 17 March 2010

(For K 1)
1.0

0.8

0.6
1 K2

0.4

0.2

0.0

2 4 6 8 10
K
Wednesday, 17 March 2010

Your turn

How does this compare to the normal
distribution? Compare the probability of
being less than 1, 2 and 3 standard
deviations away from the mean given by
Chebychev and what we know about the
normal.

Wednesday, 17 March 2010

1.0

0.8

0.6

variable
value

cheby
norm
0.4

0.2

0.0

2 4 6 8 10
x
Wednesday, 17 March 2010

LLN
Law of large numbers
X1, X2, ..., Xn iid.

n

¯
Xn = Xi
i

There are ﬁve ways to write the result.

Wednesday, 17 March 2010

What does it mean?
As we collect more and more data, the
sample mean gets closer and closer to
the true mean.
Not that surprising!
But note that we didn’t make any
assumptions about the distributions

Wednesday, 17 March 2010

CLT

Central limit theorem.
The distribution of a mean is normal when
gets big.

Wednesday, 17 March 2010

Approximation

This implies that if n is big then ...

Wednesday, 17 March 2010

Reading

Section 4.1
Focus on the general ideas and the
deﬁntions

Wednesday, 17 March 2010

Overview of how/why to reshape data in R from "wide" (spreadsheet-like) to "long" (database-like) and back. Focuses on Hadley Wickham's reshape2 package and uses state population data from the 2010 U.S. Census. Also demonstrates use of dcast() to replace table(), etc. to generate crosstabs from a sample market research consumer survey. Presented at the April 2011 meeting of the Greater Boston useR Group.

Machine learning in R

apolol92

4 R Tutorial DPLYR Apply Function

Sakthi Dasans

Data manipulation with dplyr

Romain Francois

Data Manipulation Using R (& dplyr)

Ram Narasimhan

Introducing natural language processing(NLP) with r

Vivian S. Zhang

27 developmentHadley Wickham

Viewers also liked

20 date-timesHadley Wickham

04 WrapupHadley Wickham

Correlations, Trends, and Outliers in ggplot2

Chris Rucker

24 modellingHadley Wickham

21 spamHadley Wickham

03 ConditionalHadley Wickham

Model Visualisation (with ggplot2)Hadley Wickham

Graphical inferenceHadley Wickham

03 ModellingHadley Wickham

R workshop iii -- 3 hours to learn ggplot2 series

Vivian S. Zhang

23 data-structuresHadley Wickham

R packagesHadley Wickham

02 DdplyHadley Wickham

01 IntroHadley Wickham

Reshaping Data in R

Jeffrey Breen

Machine learning in R

apolol92

4 R Tutorial DPLYR Apply Function

Sakthi Dasans

Data manipulation with dplyr

Romain Francois

Data Manipulation Using R (& dplyr)

Ram Narasimhan

Introducing natural language processing(NLP) with r

Vivian S. Zhang

Viewers also liked (20)

20 date-times

04 Wrapup

Correlations, Trends, and Outliers in ggplot2

24 modelling

21 spam

03 Conditional

Model Visualisation (with ggplot2)

Graphical inference

03 Modelling

R workshop iii -- 3 hours to learn ggplot2 series

23 data-structures

R packages

02 Ddply

01 Intro

Reshaping Data in R

Machine learning in R

4 R Tutorial DPLYR Apply Function

Data manipulation with dplyr

Data Manipulation Using R (& dplyr)

Introducing natural language processing(NLP) with r

Recently uploaded

National Security Agency - NSA mobile device best practices

Quotidiano Piemontese

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

UiPath Test Automation using UiPath Test Suite series, part 5

DianaGray10

Full-RAG: A modern architecture for hyper-personalization

Zilliz

Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.

Securing your Kubernetes cluster_ a step-by-step guide to success !

KatiaHIMEUR1

Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster. However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks. In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024

Neo4j

20 Comprehensive Checklist of Designing and Developing a Website

Pixlogix Infotech

Dive into the world of Website Designing and Developing with Pixlogix! Looking to create a stunning online presence? Look no further! Our comprehensive checklist covers everything you need to know to craft a website that stands out. From user-friendly design to seamless functionality, we've got you covered. Don't miss out on this invaluable resource! Check out our checklist now at Pixlogix and start your journey towards a captivating online presence today.

Mind map of terminologies used in context of Generative AI

Kumud Singh

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...

Neo4j

Dr. Sean Tan, Head of Data Science, Changi Airport Group Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.

RESUME BUILDER APPLICATION Project for students

KAMESHS29

GridMate - End to end testing is a critical piece to ensure quality and avoid...

ThomasParaiso2

How to Get CNIC Information System with Paksim Ga.pptx

danishmna97

Building RAG with self-deployed Milvus vector database and Snowpark Container...

Zilliz

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Neo4j

Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.

Monitoring Java Application Security with JDK Tools and JFR Events

Ana-Maria Mihalceanu

Artificial Intelligence for XMLDevelopment

Octavian Nadolu

In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject. We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup. Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved. The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring. The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise. By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.

Large Language Model (LLM) and it’s Geospatial Applications

Rohit Gautam

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

Neo4j

Leonard Jayamohan, Partner & Generative AI Lead, Deloitte This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.

DevOps and Testing slides at DASA Connect

Kari Kakkonen

Essentials of Automations: The Art of Triggers and Actions in FME

Safe Software

In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation. We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios. Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!

Recently uploaded (20)

National Security Agency - NSA mobile device best practices

Generative AI Deep Dive: Advancing from Proof of Concept to Production

UiPath Test Automation using UiPath Test Suite series, part 5

Full-RAG: A modern architecture for hyper-personalization

Securing your Kubernetes cluster_ a step-by-step guide to success !

GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024

20 Comprehensive Checklist of Designing and Developing a Website

Mind map of terminologies used in context of Generative AI

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...

RESUME BUILDER APPLICATION Project for students

GridMate - End to end testing is a critical piece to ensure quality and avoid...

How to Get CNIC Information System with Paksim Ga.pptx

Building RAG with self-deployed Milvus vector database and Snowpark Container...

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Monitoring Java Application Security with JDK Tools and JFR Events

Artificial Intelligence for XMLDevelopment

Large Language Model (LLM) and it’s Geospatial Applications

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

DevOps and Testing slides at DASA Connect

Essentials of Automations: The Art of Triggers and Actions in FME

16 Sequences

1. Stat310 Sequences of rvs Hadley Wickham Wednesday, 17 March 2010

2. Major’s day 2:30-4:30pm Today Oshman Engineering Design Kitchen Come along and talk to me (or Rudy Guerra) if you’re interested in becoming a stat major Wednesday, 17 March 2010

3. Assessment Test model answers online tonight (hopefully) Usual help session tonight 4-5pm. Wednesday, 17 March 2010

4. 1. Sequences 2. Limits 3. Chebyshev’s theorem 4. The law of large numbers 5. The central limit theorem Wednesday, 17 March 2010

5. Sequences 1 variable: X 2 variables: X, Y ... n variables: X1, X2, X3, ..., Xn Wednesday, 17 March 2010

6. Sequences Xi ~ Normal(μi, σi) Xi ~ Normal(μ, σi) Xi ~ Normal(μi, σ) Xi ~ Normal(μ, σ) Almost always assume that the Xi’s are independent. In the last case they are also identically distributed. Wednesday, 17 March 2010

7. iid = independent & identically distributed Wednesday, 17 March 2010

8. Your turn Xi are iid N(0, 2). What is E(X30)? What is Var(X2001)? What is Cor(X10, X11)? Cor(X1, X1000)? Wednesday, 17 March 2010

9. n n E( Xi ) = E(Xi ) i i n n V ar( ai Xi ) = 2 ai V ar(Xi ) i i If what is true? n n E( Xi ) = E(Xi ) i i If what is true? Wednesday, 17 March 2010

10. Limits Typically will deﬁne some function of n ¯ random variables, e.g. Xn ¯ What happens to Xn when n → ∞? Why? Because often it will converge, and we can use this to approximate results for any large n. Wednesday, 17 March 2010

11. New notation If xn → 0, and n is big, we can say xn ≈ 0. If Xn → Z, Z ~ N(0, 1), and n is big, we can say Xn ~ . N(0,1). Read as approximately distributed. Other ways to write it Wednesday, 17 March 2010

12. N go o od lim art Chebyshev it ing st -b p ut oin a t 1 P (|X − µ| Kσ) ≥ 1 − 2 K 1 P (|X − µ| Kσ) ≤ 2 K For K 0 Wednesday, 17 March 2010

13. Your turn How can you put this in words? 1 P (|X − µ| Kσ) ≤ 2 K Wednesday, 17 March 2010

14. The probability of being more than K standard deviations 80 away from the mean is less than one over K squared. 60 (For K 0) 1 K2 40 20 0 2 4 6 8 10 K Wednesday, 17 March 2010

15. (For K 1) 1.0 0.8 0.6 1 K2 0.4 0.2 0.0 2 4 6 8 10 K Wednesday, 17 March 2010

16. Your turn How does this compare to the normal distribution? Compare the probability of being less than 1, 2 and 3 standard deviations away from the mean given by Chebychev and what we know about the normal. Wednesday, 17 March 2010

17. 1.0 0.8 0.6 variable value cheby norm 0.4 0.2 0.0 2 4 6 8 10 x Wednesday, 17 March 2010

18. LLN Law of large numbers X1, X2, ..., Xn iid. n ¯ Xn = Xi i There are ﬁve ways to write the result. Wednesday, 17 March 2010

19. What does it mean? As we collect more and more data, the sample mean gets closer and closer to the true mean. Not that surprising! But note that we didn’t make any assumptions about the distributions Wednesday, 17 March 2010

20. CLT Central limit theorem. The distribution of a mean is normal when gets big. Wednesday, 17 March 2010

21. Approximation This implies that if n is big then ... Wednesday, 17 March 2010

22. Reading Section 4.1 Focus on the general ideas and the deﬁntions Wednesday, 17 March 2010

16 Sequences

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

More from Hadley Wickham

More from Hadley Wickham (20)

Recently uploaded

Recently uploaded (20)

16 Sequences