This document provides an overview of probability topics covered in Chapter 3 of an introductory statistics textbook. It defines key probability terms and concepts such as sample space, events, mutually exclusive and independent events. It presents methods for calculating probabilities using formulas like the multiplication rule and addition rule. It also introduces tools for organizing and visualizing probability problems like contingency tables, tree diagrams and Venn diagrams. Examples are provided to illustrate how to apply these probability rules and representations.
* Satoshi Hara and Kohei Hayashi. Making Tree Ensembles Interpretable: A Bayesian Model Selection Approach. AISTATS'18 (to appear).
arXiv ver.: https://arxiv.org/abs/1606.09066#
* GitHub
https://github.com/sato9hara/defragTrees
* Satoshi Hara and Kohei Hayashi. Making Tree Ensembles Interpretable: A Bayesian Model Selection Approach. AISTATS'18 (to appear).
arXiv ver.: https://arxiv.org/abs/1606.09066#
* GitHub
https://github.com/sato9hara/defragTrees
3 PROBABILITY TOPICSFigure 3.1 Meteor showers are rare, .docxtamicawaysmith
3 | PROBABILITY TOPICS
Figure 3.1 Meteor showers are rare, but the probability of them occurring can be calculated. (credit: Navicore/flickr)
Introduction
Chapter Objectives
By the end of this chapter, the student should be able to:
• Understand and use the terminology of probability.
• Determine whether two events are mutually exclusive and whether two events are independent.
• Calculate probabilities using the Addition Rules and Multiplication Rules.
• Construct and interpret Contingency Tables.
• Construct and interpret Venn Diagrams.
• Construct and interpret Tree Diagrams.
It is often necessary to "guess" about the outcome of an event in order to make a decision. Politicians study polls to guess
their likelihood of winning an election. Teachers choose a particular course of study based on what they think students can
comprehend. Doctors choose the treatments needed for various diseases based on their assessment of likely results. You
may have visited a casino where people play games chosen because of the belief that the likelihood of winning is good. You
may have chosen your course of study based on the probable availability of jobs.
You have, more than likely, used probability. In fact, you probably have an intuitive sense of probability. Probability deals
with the chance of an event occurring. Whenever you weigh the odds of whether or not to do your homework or to study
for an exam, you are using probability. In this chapter, you will learn how to solve probability problems using a systematic
approach.
Your instructor will survey your class. Count the number of students in the class today.
• Raise your hand if you have any change in your pocket or purse. Record the number of raised hands.
CHAPTER 3 | PROBABILITY TOPICS 163
• Raise your hand if you rode a bus within the past month. Record the number of raised hands.
• Raise your hand if you answered "yes" to BOTH of the first two questions. Record the number of raised hands.
Use the class data as estimates of the following probabilities. P(change) means the probability that a randomly chosen
person in your class has change in his/her pocket or purse. P(bus) means the probability that a randomly chosen person
in your class rode a bus within the last month and so on. Discuss your answers.
• Find P(change).
• Find P(bus).
• Find P(change AND bus). Find the probability that a randomly chosen student in your class has change in his/her
pocket or purse and rode a bus within the last month.
• Find P(change|bus). Find the probability that a randomly chosen student has change given that he or she rode a
bus within the last month. Count all the students that rode a bus. From the group of students who rode a bus,
count those who have change. The probability is equal to those who have change and rode a bus divided by those
who rode a bus.
3.1 | Terminology
Probability is a measure that is associated with how certain we are of outcomes of a particular experiment or activity.
An e ...
1 Probability Please read sections 3.1 – 3.3 in your .docxaryan532920
1
Probability
Please read sections 3.1 – 3.3 in your textbook
Def: An experiment is a process by which observations are generated.
Def: A variable is a quantity that is observed in the experiment.
Def: The sample space (S) for an experiment is the set of all possible outcomes.
Def: An event E is a subset of a sample space. It provides the collection of outcomes
that correspond to some classification.
Example:
Note: A sample space does not have to be finite.
Example: Pick any positive integer. The sample space is countably infinite.
A discrete sample space is one with a finite number of elements, { }1,2,3,4,5,6 or one that
has a countably infinite number of elements { }1,3,5,7,... .
A continuous sample space consists of elements forming a continuum. { }x / 2 x 5< <
2
A Venn diagram is used to show relationships between events.
A intersection B = (A ∩ B) = A and B
The outcomes in (A intersection B) belong to set A as well as to set B.
A union B = (A U B) = A alone or B alone or both
Union Formula
For any events A, B, P (A or B) = P (A) + P (B) – P (A intersection B) i.e.
P (A U B) = P (A) + P (B) – P (A ∩ B)
3
cA complement not A A ' A A = = = =
A complement consists of all outcomes outside of A.
Note: P (not A) = 1 – P (A)
Def: Two events are mutually exclusive (disjoint, incompatible) if they do not intersect,
i.e. if they do not occur at the same time. They have no outcomes in common.
When A and B are mutually exclusive, (A ∩ B) = null set = Ø, and P (A and B) = 0.
Thus, when A and B are mutually exclusive, P (A or B) = P (A) + P (B)
(This is exactly the same statement as rule 3 below)
Axioms of Probability
Def: A probability function p is a rule for calculating the probability of an event. The
function p satisfies 3 conditions:
1) 0 ≤ P (A) ≤1, for all events A in the sample space S
2) P (Sample Space S) = 1
3) If A, B, C are mutually exclusive events in the sample space S, then
P(A B C) P(A) P(B) P(C)∪ ∪ = + +
4
The Classical Probability Concept: If there are n equally likely possibilities, of which one
must occur and s are regarded as successes, then the probability of success is s
n
.
Example:
Frequency interpretation of Probability: The probability of an event E is the proportion of
times the event occurs during a long run of repeated experiments.
Example:
Def: A set function assigns a non-negative value to a set.
Ex: N (A) is a set function whose value is the number of elements in A.
Def: An additive set function f is a function for which f (A U B) = f (A) + f (B) when A and
B are mutually exclusive.
N (A) is an additive set function.
Ex: Toss 2 fair dice. Let A be the event that the sum on the two dice is 5. Let B be the
event that the sum on ...
It is a consolidation of basic probability concepts worth understanding before attempting to apply probability concepts for predictions. The material is formed from different sources. ll the sources are acknowledged.
Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 4: Probability
4.1: Basic Concepts of Probability
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
2. CHAPTER 3: PROBABILITY TOPICS
3.1 Terminology
3.2 Independent and Mutually Exclusive Events
3.3 Two Basic Rules of Probability
3.4 Contingency Tables
3.5 Tree and Venn Diagrams
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
3. CHAPTER OBJECTIVES
By the end of this chapter, the student should be able to:
• Understand and use the terminology of probability.
• Determine whether two events are mutually exclusive and
whether two events are independent.
• Calculate probabilities using the Addition Rules and
Multiplication Rules.
• Construct and interpret Contingency Tables.
• Construct and interpret Venn Diagrams.
• Construct and interpret Tree Diagrams.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
4. 3.1 TERMINOLOGY
Probability is a measure that is associated with how certain we are of
outcomes of a particular experiment or activity.
• An experiment is a planned operation carried out under controlled
conditions. If the result is not predetermined, then the experiment
is said to be a chance experiment. Flipping one fair coin twice is
an example of an experiment.
• A result of an experiment is called an outcome.
• The sample space of an experiment is the set of all possible
outcomes. Three ways to represent a sample space are: to list the
possible outcomes, to create a tree diagram, or to create a Venn
diagram.
• The uppercase letter S is used to denote the sample space. For
example, if you flip one fair coin, S = {H, T} where H = heads and T
= tails are the outcomes.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
5. MORE TERMINOLOGY
• An event is any combination of outcomes. Upper case letters like
A and B represent events. For example, if the experiment is to flip
one fair coin, event A might be getting at most one head. The
probability of an event A is written P(A).
• The probability of any outcome is the long-term relative
frequency of that outcome.
• Probabilities are between zero and one, inclusive (that is, zero
and one and all numbers between these values).
• P(A) = 0 means the event A can never happen.
• P(A) = 1 means the event A always happens.
• P(A) = 0.5 means the event A is equally likely to occur or not to
occur.
• For example, if you flip one fair coin repeatedly (from 20 to
2,000 to 20,000 times) the relative frequency of heads
approaches 0.5 (the probability of heads).
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
6. EQUALLY LIKELY EVENTS
• Equally likely means that each outcome of an experiment occurs with
equal probability.
• For example, if you toss a fair, six-sided die, each face (1, 2, 3, 4, 5, or
6) is as likely to occur as any other face.
• If you toss a fair coin, a Head (H) and a Tail (T) are equally likely to
occur.
• If you randomly guess the answer to a true/false question on an exam,
you are equally likely to select a correct answer or an incorrect answer.
• To calculate the probability of an event A when all outcomes in
the sample space are equally likely, count the number of outcomes
for event A and divide by the total number of outcomes in the sample
space.
• For example, if you toss a fair dime and a fair nickel, the sample space
is {HH, TH, HT, TT} where T = tails and H = heads. The sample space
has four outcomes. A = getting one head. There are two outcomes that
meet this condition {HT, TH}, so P(A) = 2/4= 0.5.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
7. LAW OF LARGE NUMBERS
The law of large numbers states that as the number of repetitions of
an experiment is increased, the relative frequency obtained in the
experiment tends to become closer and closer to the theoretical
probability.
• Example
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
8. “OR”, “AND”
"OR" Event:
• An outcome is in the event A OR B if the outcome is in A or is in B
or is in both A and B. For example, let A = {1, 2, 3, 4, 5} and B = {4,
5, 6, 7, 8}. A OR B = {1, 2, 3, 4, 5, 6, 7, 8}. Notice that 4 and 5 are
NOT listed twice.
"AND" Event:
• An outcome is in the event A AND B if the outcome is in both A and
B at the same time. For example, let A and B be {1, 2, 3, 4, 5} and
{4, 5, 6, 7, 8}, respectively. Then A AND B = {4, 5}.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
9. COMPLEMENT
The complement of event A is denoted A′ (read "A prime"). A′ consists
of all outcomes that are NOT in A.
• Notice that P(A) + P(A′) = 1.
• For example, let S = {1, 2, 3, 4, 5, 6} and let A = {1, 2, 3, 4}.
• Then, A′ = {5, 6}.
• P(A) = 4/6,
• P(A′) = 2/6
• and P(A) + P(A′) = 4/6+ 2/6 = 1
• Examples
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
10. CONDITIONAL PROBABILITY
The conditional probability of A given B is written P(A|B). P(A|B) is
the probability that event A will occur given that the event B has
already occurred. A conditional reduces the sample space. We
calculate the probability of A from the reduced sample space B.
• The formula to calculate P(A|B) is
• where P(B) is greater than zero.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
11. CONDITIONAL PROBABILITY
EXAMPLE
For example, suppose we toss one fair, six-sided die. The sample
space S = {1, 2, 3, 4, 5, 6}. Let A = face is 2 or 3 and B = face is even
(2, 4, 6). To calculate P(A|B), we count the number of outcomes 2 or 3
in the sample space B = {2, 4, 6}. Then we divide that by the number of
outcomes B (rather than S).
We get the same result by using the formula. Remember that S has six
outcomes.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
15. 3.2 INDEPENDENT AND
MUTUALLY EXCLUSIVE EVENTS
Independent and mutually exclusive do not mean the same thing.
Independent Events
• Two events are independent if the following are true:
• P(A|B) = P(A)
• P(B|A) = P(B)
• P(A AND B) = P(A)P(B)
• Two events A and B are independent if the knowledge that one
occurred does not affect the chance the other occurs.
• For example, the outcomes of two roles of a fair die are independent
events. The outcome of the first roll does not change the probability for
the outcome of the second roll.
• To show two events are independent, you must show only one of the
above conditions. If two events are NOT independent, then we say
that they are dependent.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
16. SAMPLING WITH OR WITHOUT
REPLACEMENT
Sampling may be done with replacement or without replacement.
• With replacement: If each member of a population is replaced
after it is picked, then that member has the possibility of being
chosen more than once. When sampling is done with replacement,
then events are considered to be independent, meaning the result
of the first pick will not change the probabilities for the second pick.
• Without replacement: When sampling is done without
replacement, each member of a population may be chosen only
once. In this case, the probabilities for the second pick are affected
by the result of the first pick. The events are considered to be
dependent or not independent.
If it is not known whether A and B are independent or dependent,
assume they are dependent until you can show otherwise.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
17. ADDITIONAL EXAMPLES
CARD DECK
You have a fair, well-shuffled deck of 52 cards. It consists of four suits.
The suits are clubs, diamonds, hearts and spades. There are 13 cards
in each suit consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, J (jack), Q (queen),
K (king) of that suit.
• Examples
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
18. MUTUALLY EXCLUSIVE EVENTS
A and B are mutually exclusive events if they cannot occur at the
same time. This means that A and B do not share any outcomes and
P(A AND B) = 0.
• For example, suppose the sample space S = {1, 2, 3, 4, 5, 6, 7, 8,
9, 10}. Let A = {1, 2, 3, 4, 5}, B = {4, 5, 6, 7, 8}, and C = {7, 9}.
• A AND B = {4, 5}. P(A AND B) = 2/10 and is not equal to zero.
Therefore, A and B are not mutually exclusive. A and C do not
have any numbers in common so P(A AND C) = 0. Therefore, A
and C are mutually exclusive.
Are mutually exclusive events dependent or independent? Why?
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
19. 3.3 TWO BASIC RULES OF
PROBABILITY: THE
MULTIPLICATION RULE
The Multiplication Rule
• If A and B are two events defined on a sample space, then: P(A
AND B) = P(B)P(A|B).
• This rule may also be written as:
• (The probability of A given B equals the probability of A and B
divided by the probability of B.)
If A and B are independent, then P(A|B) = P(A). Then P(A AND B) =
P(A|B)P(B) becomes P(A AND B) = P(A)P(B).
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
20. THE ADDITION RULE
The Addition Rule
If A and B are defined on a sample space, then:
• P(A OR B) = P(A) + P(B) - P(A AND B).
• If A and B are mutually exclusive, then P(A AND B) = 0.
• Then P(A OR B) = P(A) + P(B) - P(A AND B) becomes P(A OR B) =
P(A) + P(B).
• Examples
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
21. EXAMPLE 3.15
Carlos plays college soccer. He makes a goal 65% of the time he
shoots. Carlos is going to attempt two goals in a row in the next game.
A = the event Carlos is successful on his first attempt. P(A) = 0.65. B =
the event Carlos is successful on his second attempt. P(B) = 0.65.
Carlos tends to shoot in streaks. The probability that he makes the
second goal GIVEN that he made the first goal is 0.90.
• a. What is the probability that he makes both goals?
• b. What is the probability that Carlos makes either the first goal or
the second goal?
• c. Are A and B independent?
• d. Are A and B mutually exclusive?
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
22. EXAMPLE 3.17
Felicity attends Modesto JC in Modesto, CA. The probability that
Felicity enrolls in a math class is 0.2 and the probability that she enrolls
in a speech class is 0.65. The probability that she enrolls in a math
class GIVEN that she enrolls in speech class is 0.25.
Let: M = math class, S = speech class, M|S = math given speech
• a. What is the probability that Felicity enrolls in math and speech?
Find P(M AND S) = P(M|S)P(S).
• b. What is the probability that Felicity enrolls in math or speech
classes? Find P(M OR S) = P(M) + P(S) - P(M AND S).
• c. Are M and S independent? Is P(M|S) = P(M)?
• d. Are M and S mutually exclusive? Is P(M AND S) = 0?
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
24. 3.4 CONTINGENCY TABLES
A contingency table provides a way of portraying data that can
facilitate calculating probabilities. The table helps in determining
conditional probabilities quite easily. The table displays sample values
in relation to two different variables that may be dependent or
contingent on one another.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
26. 3.5 TREE AND VENN DIAGRAMS
Sometimes, when the probability problems are complex, it can be
helpful to graph the situation. Tree diagrams and Venn diagrams are
two tools that can be used to visualize and solve conditional
probabilities.
Tree Diagrams
• A tree diagram is a special type of graph used to determine the
outcomes of an experiment. It consists of "branches" that are
labeled with either frequencies or probabilities. Tree diagrams can
make some probability problems easier to visualize and solve.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
27. EXAMPLE 3.25: TREE DIAGRAMS
An urn has three red marbles and
eight blue marbles in it. Draw two
marbles, one at a time, this time
without replacement, from the urn.
"Without replacement" means
that you do not put the first ball
back before you select the second
marble.
Following is a tree diagram for this
situation. The branches are
labeled with probabilities instead
of frequencies.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
28. VENN DIAGRAMS
A Venn diagram is a picture that represents the outcomes of an
experiment. It generally consists of a box that represents the sample
space S together with circles or ovals. The circles or ovals represent
events.
• Examples
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics
29. EXAMPLE 3.29
Forty percent of the students at a local college belong to a club and
50% work part time. Five percent of the students work part time and
belong to a club. Draw a Venn diagram showing the relationships. Let C
= student belongs to a club and PT = student works part time.
If a student is selected at random, find
• the probability that the student belongs
to a club.
• the probability that the student works
part time.
• the probability that the student belongs
to a club AND works part time.
• the probability that the student belongs
to a club given that the student works
part time.
• the probability that the student belongs
to a club OR works part time.
Prepared by the College of Coastal Georgia for OpenStax Introductory Statistics