SlideShare a Scribd company logo
1 of 28
Download to read offline
Sampling from a Data
Dr. Hrudaya Kumar Tripathy
What is Sampling ?
• The sample method involves taking a representative selection of the
population and using the data collected as research information.
• A sample is a “subgroup of a population”.
• As a way of obtaining a group of people or objects to study that were
representative of a large population or universe of interest. (Stacks &
Hocking, 1999)
•
Concept of Sampling
POPULATION ELEMENT
SUBJECT
SAMPLE
Types of Sampling
• Probabilty Sampling:
A sampling process where every single individual elements in the population
have an oppertunity to be choosen as a sample.
• Nonprobabilty Sampling:
A sampling process where every single individual elements in the population
may not have an opportunity to be choosen as a sample.
Convenience sample: The researcher chooses a sample that is readily available
in some non-random way.
Example: A researcher polls people as they walk by on the street.
Why it's probably biased: The location and time of day and other factors may
produce a biased sample of people.
Voluntary response sample: The researcher puts out a request for members of
a population to join the sample, and people decide whether or not to be in the
sample.
Example: A TV show host asks his viewers to visit his website and respond to an
online poll.
Why it's probably biased: People who take the time to respond tend to have similarly
strong opinions compared to the rest of the population.
Probabilty Sampling
Bad ways to sample
Probabilty Sampling
• Simple Random Sampling
• Stratified sampling
• Systematic sampling
• Cluster Sampling
• Multi stage Sampling
Good ways to sample
Simple Random Sampling
• Every element has an equal chance of getting selected to be the part sample.
• It is used when we don’t have any kind of prior information about the target
population.
• Random selection of sample with out any procedure or criteria.
For example: Random selection of
20 students from class of 50
student. Each student has equal
chance of getting selected. Here
probability of selection is 1/50
Why it's good: Random samples are usually fairly representative since they don't favor
certain members.
Stratified Sampling
• This technique divides the elements of the population into small subgroups
based on the similarity in such a way that the elements within the group are
homogeneous and heterogeneous among the other subgroups formed.
• And then the elements are randomly selected from each of these subgroups.
• We need to have prior information about the population to create
subgroups.
Example—A student council surveys 100
students by getting random samples of 25
freshmen, 25 sophomores, 25 juniors, and 25
seniors.
Why it's good: A stratified sample guarantees that members from each group will be
represented in the sample, so this sampling method is good when we want some members
from every group.
Cluster Sampling
• Process to choose the sample according to sections/ clusters.
• Our entire population is divided into clusters or sections and then the
clusters are randomly selected.
• All the elements of the cluster are used for sampling.
• Clusters are identified using details such as age, sex, location etc.
Cluster sampling can be done in following ways:
• Single Stage Cluster Sampling
• Two Stage Cluster Sampling
• Single Stage Cluster Sampling
Entire cluster is selected randomly for
sampling.
Two Stage Cluster Sampling
Here first we randomly select clusters
and then from those selected clusters we
randomly select elements for sampling
Cluster Sampling (cont..)
Example: An airline company wants to survey its customers one day, so they
randomly select 55 flights that day and survey every passenger on those
flights.
Why it's good: A cluster sample gets every member from some of the
groups, so it's good when each group reflects the population as a whole.
Systematic Clustering
• Here the selection of elements is systematic and not random except the first
element.
• Elements of a sample are chosen at regular intervals of population.
• All the elements are put together in a sequence first where each element
has the equal chance of being selected.
• Example: A principal takes an alphabetized list of student names and picks a
random starting point. Every 20th student is selected to take a survey.
For a sample of size n, we divide our population of size N into subgroups of k
elements.
We select our first element randomly from the first subgroup of k elements.
To select other elements of sample, perform following:
We know number of elements in each group is k i.e N/n
So if our first element is n1 then Second element is n1+k i.e n2
Third element n2+k i.e n3 and so on..
Taking an example of N=20, n=5
No of elements in each of the subgroups is N/n i.e 20/5 =4= k
Now, randomly select first element from the first subgroup.
If we select n1= 3, n2 = n1+k = 3+4 = 7, n3 = n2+k = 7+4 = 11
Systematic Clustering (cont..)
Area Sampling
Multi-Stage Sampling
• It is the combination of one or more methods
described earlier.
• Population is divided into multiple clusters and
then these clusters are further divided and
grouped into various sub groups based on
similarity.
• One or more clusters can be randomly selected
from each sub-groups.
• This process continues until the cluster can’t be
divided anymore.
• For example country can be divided into states,
cities, urban and rural and all the areas with
similar characteristics can be merged together to
form a sub-groups.
Process which depends on the gegrophical/prospective positions.
QUIZ
1. A restaurant leaves comment cards on all of its tables and encourages
customers to participate in a brief survey to learn about their overall
experience. What type of sampling is this?
A: Convenience sampling B: Voluntary response samplingB: Voluntary response sampling
2. A quality control worker at a factory selects the first 10 items she sees
as her sample for the day. What type of sampling is this?
A: Convenience sampling B: Voluntary response samplingA: Convenience sampling
3. Each student at a school has a student identification number.
Counselors have a computer generate 50 random identification numbers
and those students are asked to take a survey.
A: Simple random sampling B: Stratified random sampling
C: Cluster random sampling D: Systematic random sampling
A: Simple random sampling
4. A principal orders t-shirts and wants to check some of them to make
sure they were printed properly. She randomly selects 2 of the 10 boxes
of shirts and checks every shirt in those 2 boxes.
A: Simple random sampling B: Stratified random sampling
C: Cluster random sampling D: Systematic random samplingC: Cluster random sampling
5. A school chooses 3 randomly selected athletes from each of its sports
teams to participate in a survey about athletics at the school.
A: Simple random sampling B: Stratified random sampling
C: Cluster random sampling D: Systematic random sampling
6. While students are lined up for school pictures, a teacher passes out a
survey to every 10th student.
A: Simple random sampling B: Stratified random sampling
C: Cluster random sampling D: Systematic random sampling
B: Stratified random sampling
D: Systematic random sampling
Nonprobabilty Sampling
• Convenience Sampling
• Purpose Sampling/Judgemental Sampling
• Quota Sampling
• Referral /Snowball Sampling: Process of getting a sample by one stage to
another stage after getting recomondation.
Convenience Sampling
• Here the samples are selected based on the availability.
• This method is used when the availability of sample is rare and also costly.
• So based on the convenience samples are selected.
• Process of choosing a sample according to suitabilty.
For example: Researchers prefer this during the initial stages of survey
research, as it’s quick and easy to deliver results.
Purposive Sampling
• This is based on the intention or the purpose of study.
• Only those elements will be selected from the population which suits the
best for the purpose of our study.
• Choosing a sample because of represent the certain purpose.
For example: If we want to understand the thought process of the people who
are interested in pursuing master’s degree then the selection criteria would be
“Are you interested for Masters in..?”
All the people who respond with a “No” will be excluded from our sample.
Quota Sampling
• This type of sampling depends of some pre-set standard.
• It selects the representative sample from the population.
• Proportion of characteristics/ trait in sample should be same as population.
• Elements are selected until exact proportions of certain types of data is
obtained or sufficient data in different categories is collected.
For example: If our population has 45% females and 55% males then our
sample should reflect the same percentage of males and females.
Referral /Snowball Sampling
• This technique is used in the situations
where the population is completely
unknown and rare.
• Therefore we will take the help from the
first element which we select for the
population and ask him to recommend
o t h e r e l e m e nt s w h o w i l l f i t t h e
description of the sample needed.
• So this referral technique goes on,
increasing the size of population like a
snowball.
Sampling from a Data Stream........
Data Sampling?
Data sampling is a statistical analysis technique used to select,
manipulate and analyze a representative subset of data points
in order to identify patterns and trends in the larger data set
being examined.
Stream Queries
• There are two ways that queries get asked about streams.
• Ad-hoc Queries: Normal queries asked one time about streams.
• Example: What is the maximum value seen so far in stream S?
• Standing Queries: These queries are, in a sense, permanently
executing, and produce outputs at appropriate times. Queries
that are in principle, asked about the stream at all time.
• Example: Report each maximum value ever seen in stream S.
Problems on Data Streams
• Types of queries one wants on answer on a stream:
– Filtering a data stream
• Select elements with property x from the stream
– Counting distinct elements
• Number of distinct elements in the last k elements of the stream
– Estimating moments
• Estimate avg./std. dev. of last k elements
– Finding frequent elements
Applications – (1)
• Mining query streams
• Google wants to know what queries are more frequent today than
yesterday
• Mining click streams
• Yahoo wants to know which of its pages are getting an unusual
number of hits in the past hour
• Mining social network news feeds
• e.g., look for trending topics on Twitter, Facebook
27
Applications – (2)
• Sensor Networks
• Many sensors feeding into a central controller
• Telephone call records
• Data feeds into customer bills as well as settlements between
telephone companies
• IP packets monitored at a switch
• Gather information for optimal routing
• Detect denial-of-service attacks
28

More Related Content

What's hot

Systematic sampling in probability sampling
Systematic sampling in probability sampling Systematic sampling in probability sampling
Systematic sampling in probability sampling Sachin H
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniquesDr. Adrija Roy
 
PROBABILITY SAMPLING TECHNIQUES
PROBABILITY SAMPLING TECHNIQUESPROBABILITY SAMPLING TECHNIQUES
PROBABILITY SAMPLING TECHNIQUESAzam Ghaffar
 
Sampling in qualitative researc
Sampling in qualitative researcSampling in qualitative researc
Sampling in qualitative researckavita yadav
 
Sampling and its types
Sampling and its typesSampling and its types
Sampling and its typesPrabhleen Arora
 
Sampling
SamplingSampling
SamplingAmna Javed
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniqueschetan1923
 
Collection of Quantitative Data
Collection of Quantitative DataCollection of Quantitative Data
Collection of Quantitative Datareinalyn cacayan
 
Probability sampling techniques
Probability sampling techniquesProbability sampling techniques
Probability sampling techniquesMark Santos
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Dalia El-Shafei
 
Systematic ranom sampling for slide share
Systematic ranom sampling for slide shareSystematic ranom sampling for slide share
Systematic ranom sampling for slide shareIVenkatReddyGaaru
 
Simple random sampling
Simple random samplingSimple random sampling
Simple random samplingsuncil0071
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniquesIrfan Hussain
 
SAMPLING MEANING AND TYPES
SAMPLING  MEANING AND TYPES SAMPLING  MEANING AND TYPES
SAMPLING MEANING AND TYPES Sundar B N
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniquesBharat Paul
 
Introduction to basic concept in sampling and sampling techniques
Introduction to basic concept in sampling and sampling techniquesIntroduction to basic concept in sampling and sampling techniques
Introduction to basic concept in sampling and sampling techniquesJezhabeth Villegas
 

What's hot (20)

Systematic sampling in probability sampling
Systematic sampling in probability sampling Systematic sampling in probability sampling
Systematic sampling in probability sampling
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Sampling
SamplingSampling
Sampling
 
PROBABILITY SAMPLING TECHNIQUES
PROBABILITY SAMPLING TECHNIQUESPROBABILITY SAMPLING TECHNIQUES
PROBABILITY SAMPLING TECHNIQUES
 
Sampling Techniques
Sampling TechniquesSampling Techniques
Sampling Techniques
 
Sampling in qualitative researc
Sampling in qualitative researcSampling in qualitative researc
Sampling in qualitative researc
 
Sampling and its types
Sampling and its typesSampling and its types
Sampling and its types
 
Sampling
SamplingSampling
Sampling
 
Types of Sampling
Types of SamplingTypes of Sampling
Types of Sampling
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Collection of Quantitative Data
Collection of Quantitative DataCollection of Quantitative Data
Collection of Quantitative Data
 
Probability sampling techniques
Probability sampling techniquesProbability sampling techniques
Probability sampling techniques
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
 
Systematic ranom sampling for slide share
Systematic ranom sampling for slide shareSystematic ranom sampling for slide share
Systematic ranom sampling for slide share
 
Simple random sampling
Simple random samplingSimple random sampling
Simple random sampling
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
SAMPLING MEANING AND TYPES
SAMPLING  MEANING AND TYPES SAMPLING  MEANING AND TYPES
SAMPLING MEANING AND TYPES
 
Sampling techniques
Sampling techniquesSampling techniques
Sampling techniques
 
Chapter 8-SAMPLE & SAMPLING TECHNIQUES
Chapter 8-SAMPLE & SAMPLING TECHNIQUESChapter 8-SAMPLE & SAMPLING TECHNIQUES
Chapter 8-SAMPLE & SAMPLING TECHNIQUES
 
Introduction to basic concept in sampling and sampling techniques
Introduction to basic concept in sampling and sampling techniquesIntroduction to basic concept in sampling and sampling techniques
Introduction to basic concept in sampling and sampling techniques
 

Similar to Lecture7.1 data sampling

Sampling techniques in research
Sampling techniques in researchSampling techniques in research
Sampling techniques in researchJulie Atwebembeire
 
Sampling biostatistics.pptx
Sampling biostatistics.pptxSampling biostatistics.pptx
Sampling biostatistics.pptxAhmedMinhas3
 
Sampling and sampling distribution
Sampling and sampling distributionSampling and sampling distribution
Sampling and sampling distributionAli Raza
 
SAMPLING.ppt
SAMPLING.pptSAMPLING.ppt
SAMPLING.pptaigil2
 
Sampling Chapter No 10
Sampling Chapter No 10Sampling Chapter No 10
Sampling Chapter No 10Abdul Basit
 
unit 10 Sampling presentation L- short.ppt
unit 10 Sampling presentation L- short.pptunit 10 Sampling presentation L- short.ppt
unit 10 Sampling presentation L- short.pptMitikuTeka1
 
Methods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeMethods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeAnup Suchak
 
Sampling types-presentation-business research
Sampling types-presentation-business researchSampling types-presentation-business research
Sampling types-presentation-business researchHareesh M
 
An overview of sampling
An overview of samplingAn overview of sampling
An overview of samplingRafath Razia
 
An overview of sampling
An overview of samplingAn overview of sampling
An overview of samplingRafath Razia
 
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptx
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptxSAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptx
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptxkittustudy7
 
Sampling PPT By RG.pdf
Sampling PPT By RG.pdfSampling PPT By RG.pdf
Sampling PPT By RG.pdfDisappointer07
 
SAMPLING PROCEDURES.pptx
SAMPLING PROCEDURES.pptxSAMPLING PROCEDURES.pptx
SAMPLING PROCEDURES.pptxAnalieCabanlit1
 
Sample and sampling techniques
Sample and sampling techniquesSample and sampling techniques
Sample and sampling techniquesAnupam Ghosh
 
Sampling and its types of Biostatistics.
Sampling and its types of Biostatistics.Sampling and its types of Biostatistics.
Sampling and its types of Biostatistics.AabidMir10
 

Similar to Lecture7.1 data sampling (20)

Sampling
SamplingSampling
Sampling
 
Sampling techniques in research
Sampling techniques in researchSampling techniques in research
Sampling techniques in research
 
Sampling biostatistics.pptx
Sampling biostatistics.pptxSampling biostatistics.pptx
Sampling biostatistics.pptx
 
2RM2 PPT.pptx
2RM2 PPT.pptx2RM2 PPT.pptx
2RM2 PPT.pptx
 
Sampling and sampling distribution
Sampling and sampling distributionSampling and sampling distribution
Sampling and sampling distribution
 
SAMPLING.ppt
SAMPLING.pptSAMPLING.ppt
SAMPLING.ppt
 
Sampling
SamplingSampling
Sampling
 
Sampling Chapter No 10
Sampling Chapter No 10Sampling Chapter No 10
Sampling Chapter No 10
 
unit 10 Sampling presentation L- short.ppt
unit 10 Sampling presentation L- short.pptunit 10 Sampling presentation L- short.ppt
unit 10 Sampling presentation L- short.ppt
 
Methods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeMethods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample Size
 
4. Sampling.pptx
4. Sampling.pptx4. Sampling.pptx
4. Sampling.pptx
 
Sampling types-presentation-business research
Sampling types-presentation-business researchSampling types-presentation-business research
Sampling types-presentation-business research
 
An overview of sampling
An overview of samplingAn overview of sampling
An overview of sampling
 
An overview of sampling
An overview of samplingAn overview of sampling
An overview of sampling
 
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptx
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptxSAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptx
SAMPLING METHODS &TYPES, & TECHNIQUES & EXAMPLES.pptx
 
Sampling PPT By RG.pdf
Sampling PPT By RG.pdfSampling PPT By RG.pdf
Sampling PPT By RG.pdf
 
SAMPLING PROCEDURES.pptx
SAMPLING PROCEDURES.pptxSAMPLING PROCEDURES.pptx
SAMPLING PROCEDURES.pptx
 
Sample and sampling techniques
Sample and sampling techniquesSample and sampling techniques
Sample and sampling techniques
 
Sampling and its types of Biostatistics.
Sampling and its types of Biostatistics.Sampling and its types of Biostatistics.
Sampling and its types of Biostatistics.
 
Sampling & Its Types
Sampling & Its TypesSampling & Its Types
Sampling & Its Types
 

More from hktripathy

Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematicshktripathy
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your datahktripathy
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
 
Lecture5 virtualization
Lecture5 virtualizationLecture5 virtualization
Lecture5 virtualizationhktripathy
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streamshktripathy
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
Lecture3 business intelligence
Lecture3 business intelligenceLecture3 business intelligence
Lecture3 business intelligencehktripathy
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cyclehktripathy
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Lect9 Decision tree
Lect9 Decision treeLect9 Decision tree
Lect9 Decision treehktripathy
 
Lect8 Classification & prediction
Lect8 Classification & predictionLect8 Classification & prediction
Lect8 Classification & predictionhktripathy
 
Lect7 Association analysis to correlation analysis
Lect7 Association analysis to correlation analysisLect7 Association analysis to correlation analysis
Lect7 Association analysis to correlation analysishktripathy
 
Lect6 Association rule & Apriori algorithm
Lect6 Association rule & Apriori algorithmLect6 Association rule & Apriori algorithm
Lect6 Association rule & Apriori algorithmhktripathy
 
Lect5 principal component analysis
Lect5 principal component analysisLect5 principal component analysis
Lect5 principal component analysishktripathy
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-Ihktripathy
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mininghktripathy
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your datahktripathy
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
 

More from hktripathy (18)

Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your data
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 
Lecture5 virtualization
Lecture5 virtualizationLecture5 virtualization
Lecture5 virtualization
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streams
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Lecture3 business intelligence
Lecture3 business intelligenceLecture3 business intelligence
Lecture3 business intelligence
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cycle
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Lect9 Decision tree
Lect9 Decision treeLect9 Decision tree
Lect9 Decision tree
 
Lect8 Classification & prediction
Lect8 Classification & predictionLect8 Classification & prediction
Lect8 Classification & prediction
 
Lect7 Association analysis to correlation analysis
Lect7 Association analysis to correlation analysisLect7 Association analysis to correlation analysis
Lect7 Association analysis to correlation analysis
 
Lect6 Association rule & Apriori algorithm
Lect6 Association rule & Apriori algorithmLect6 Association rule & Apriori algorithm
Lect6 Association rule & Apriori algorithm
 
Lect5 principal component analysis
Lect5 principal component analysisLect5 principal component analysis
Lect5 principal component analysis
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your data
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 

Recently uploaded

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 

Recently uploaded (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
CĂłdigo Creativo y Arte de Software | Unidad 1
CĂłdigo Creativo y Arte de Software | Unidad 1CĂłdigo Creativo y Arte de Software | Unidad 1
CĂłdigo Creativo y Arte de Software | Unidad 1
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 

Lecture7.1 data sampling

  • 1. Sampling from a Data Dr. Hrudaya Kumar Tripathy
  • 2. What is Sampling ? • The sample method involves taking a representative selection of the population and using the data collected as research information. • A sample is a “subgroup of a population”. • As a way of obtaining a group of people or objects to study that were representative of a large population or universe of interest. (Stacks & Hocking, 1999) •
  • 3. Concept of Sampling POPULATION ELEMENT SUBJECT SAMPLE
  • 4. Types of Sampling • Probabilty Sampling: A sampling process where every single individual elements in the population have an oppertunity to be choosen as a sample. • Nonprobabilty Sampling: A sampling process where every single individual elements in the population may not have an opportunity to be choosen as a sample.
  • 5. Convenience sample: The researcher chooses a sample that is readily available in some non-random way. Example: A researcher polls people as they walk by on the street. Why it's probably biased: The location and time of day and other factors may produce a biased sample of people. Voluntary response sample: The researcher puts out a request for members of a population to join the sample, and people decide whether or not to be in the sample. Example: A TV show host asks his viewers to visit his website and respond to an online poll. Why it's probably biased: People who take the time to respond tend to have similarly strong opinions compared to the rest of the population. Probabilty Sampling Bad ways to sample
  • 6. Probabilty Sampling • Simple Random Sampling • Stratified sampling • Systematic sampling • Cluster Sampling • Multi stage Sampling Good ways to sample
  • 7. Simple Random Sampling • Every element has an equal chance of getting selected to be the part sample. • It is used when we don’t have any kind of prior information about the target population. • Random selection of sample with out any procedure or criteria. For example: Random selection of 20 students from class of 50 student. Each student has equal chance of getting selected. Here probability of selection is 1/50 Why it's good: Random samples are usually fairly representative since they don't favor certain members.
  • 8. Stratified Sampling • This technique divides the elements of the population into small subgroups based on the similarity in such a way that the elements within the group are homogeneous and heterogeneous among the other subgroups formed. • And then the elements are randomly selected from each of these subgroups. • We need to have prior information about the population to create subgroups. Example—A student council surveys 100 students by getting random samples of 25 freshmen, 25 sophomores, 25 juniors, and 25 seniors. Why it's good: A stratified sample guarantees that members from each group will be represented in the sample, so this sampling method is good when we want some members from every group.
  • 9. Cluster Sampling • Process to choose the sample according to sections/ clusters. • Our entire population is divided into clusters or sections and then the clusters are randomly selected. • All the elements of the cluster are used for sampling. • Clusters are identified using details such as age, sex, location etc. Cluster sampling can be done in following ways: • Single Stage Cluster Sampling • Two Stage Cluster Sampling
  • 10. • Single Stage Cluster Sampling Entire cluster is selected randomly for sampling. Two Stage Cluster Sampling Here first we randomly select clusters and then from those selected clusters we randomly select elements for sampling
  • 11. Cluster Sampling (cont..) Example: An airline company wants to survey its customers one day, so they randomly select 55 flights that day and survey every passenger on those flights. Why it's good: A cluster sample gets every member from some of the groups, so it's good when each group reflects the population as a whole.
  • 12. Systematic Clustering • Here the selection of elements is systematic and not random except the first element. • Elements of a sample are chosen at regular intervals of population. • All the elements are put together in a sequence first where each element has the equal chance of being selected. • Example: A principal takes an alphabetized list of student names and picks a random starting point. Every 20th student is selected to take a survey.
  • 13. For a sample of size n, we divide our population of size N into subgroups of k elements. We select our first element randomly from the first subgroup of k elements. To select other elements of sample, perform following: We know number of elements in each group is k i.e N/n So if our first element is n1 then Second element is n1+k i.e n2 Third element n2+k i.e n3 and so on.. Taking an example of N=20, n=5 No of elements in each of the subgroups is N/n i.e 20/5 =4= k Now, randomly select first element from the first subgroup. If we select n1= 3, n2 = n1+k = 3+4 = 7, n3 = n2+k = 7+4 = 11 Systematic Clustering (cont..)
  • 14. Area Sampling Multi-Stage Sampling • It is the combination of one or more methods described earlier. • Population is divided into multiple clusters and then these clusters are further divided and grouped into various sub groups based on similarity. • One or more clusters can be randomly selected from each sub-groups. • This process continues until the cluster can’t be divided anymore. • For example country can be divided into states, cities, urban and rural and all the areas with similar characteristics can be merged together to form a sub-groups. Process which depends on the gegrophical/prospective positions.
  • 15. QUIZ 1. A restaurant leaves comment cards on all of its tables and encourages customers to participate in a brief survey to learn about their overall experience. What type of sampling is this? A: Convenience sampling B: Voluntary response samplingB: Voluntary response sampling 2. A quality control worker at a factory selects the first 10 items she sees as her sample for the day. What type of sampling is this? A: Convenience sampling B: Voluntary response samplingA: Convenience sampling
  • 16. 3. Each student at a school has a student identification number. Counselors have a computer generate 50 random identification numbers and those students are asked to take a survey. A: Simple random sampling B: Stratified random sampling C: Cluster random sampling D: Systematic random sampling A: Simple random sampling 4. A principal orders t-shirts and wants to check some of them to make sure they were printed properly. She randomly selects 2 of the 10 boxes of shirts and checks every shirt in those 2 boxes. A: Simple random sampling B: Stratified random sampling C: Cluster random sampling D: Systematic random samplingC: Cluster random sampling
  • 17. 5. A school chooses 3 randomly selected athletes from each of its sports teams to participate in a survey about athletics at the school. A: Simple random sampling B: Stratified random sampling C: Cluster random sampling D: Systematic random sampling 6. While students are lined up for school pictures, a teacher passes out a survey to every 10th student. A: Simple random sampling B: Stratified random sampling C: Cluster random sampling D: Systematic random sampling B: Stratified random sampling D: Systematic random sampling
  • 18. Nonprobabilty Sampling • Convenience Sampling • Purpose Sampling/Judgemental Sampling • Quota Sampling • Referral /Snowball Sampling: Process of getting a sample by one stage to another stage after getting recomondation.
  • 19. Convenience Sampling • Here the samples are selected based on the availability. • This method is used when the availability of sample is rare and also costly. • So based on the convenience samples are selected. • Process of choosing a sample according to suitabilty. For example: Researchers prefer this during the initial stages of survey research, as it’s quick and easy to deliver results.
  • 20. Purposive Sampling • This is based on the intention or the purpose of study. • Only those elements will be selected from the population which suits the best for the purpose of our study. • Choosing a sample because of represent the certain purpose. For example: If we want to understand the thought process of the people who are interested in pursuing master’s degree then the selection criteria would be “Are you interested for Masters in..?” All the people who respond with a “No” will be excluded from our sample.
  • 21. Quota Sampling • This type of sampling depends of some pre-set standard. • It selects the representative sample from the population. • Proportion of characteristics/ trait in sample should be same as population. • Elements are selected until exact proportions of certain types of data is obtained or sufficient data in different categories is collected. For example: If our population has 45% females and 55% males then our sample should reflect the same percentage of males and females.
  • 22. Referral /Snowball Sampling • This technique is used in the situations where the population is completely unknown and rare. • Therefore we will take the help from the first element which we select for the population and ask him to recommend o t h e r e l e m e nt s w h o w i l l f i t t h e description of the sample needed. • So this referral technique goes on, increasing the size of population like a snowball.
  • 23. Sampling from a Data Stream........
  • 24. Data Sampling? Data sampling is a statistical analysis technique used to select, manipulate and analyze a representative subset of data points in order to identify patterns and trends in the larger data set being examined.
  • 25. Stream Queries • There are two ways that queries get asked about streams. • Ad-hoc Queries: Normal queries asked one time about streams. • Example: What is the maximum value seen so far in stream S? • Standing Queries: These queries are, in a sense, permanently executing, and produce outputs at appropriate times. Queries that are in principle, asked about the stream at all time. • Example: Report each maximum value ever seen in stream S.
  • 26. Problems on Data Streams • Types of queries one wants on answer on a stream: – Filtering a data stream • Select elements with property x from the stream – Counting distinct elements • Number of distinct elements in the last k elements of the stream – Estimating moments • Estimate avg./std. dev. of last k elements – Finding frequent elements
  • 27. Applications – (1) • Mining query streams • Google wants to know what queries are more frequent today than yesterday • Mining click streams • Yahoo wants to know which of its pages are getting an unusual number of hits in the past hour • Mining social network news feeds • e.g., look for trending topics on Twitter, Facebook 27
  • 28. Applications – (2) • Sensor Networks • Many sensors feeding into a central controller • Telephone call records • Data feeds into customer bills as well as settlements between telephone companies • IP packets monitored at a switch • Gather information for optimal routing • Detect denial-of-service attacks 28