SlideShare a Scribd company logo
1 of 46
D A T A P R E P A R A T I O N | F R E Q U E N C Y
D I S T R I B U T I O N | C R O S S - T A B U L A T I O N
DATA COLLECTION
PREPARATION
DATA
• Data is anything that has been produced or
created during research. Primary data is data that
you have created yourself, but your data sets can
also contain data that has been created by
other researchers.
WHAT IS DATA COLLECTION?
• It is the process of gathering and measuring
information on variables of interest, in an
established systematic fashion that enables
one to answer stated research questions,
test hypotheses, and evaluate outcomes.
METHODS OF DATA COLLECTION
• A. Interview (Direct) Method – a
method of person-to-person exchange
between the interviewer and the
interviewee.
METHODS OF DATA COLLECTION
POSITIVE
• 1) It provides consistent and more precise
information since clarification maybe given by the
interviewee.
• 2) Questions maybe repeated or maybe
modified to suit the interviewee’s level of
understanding.
METHODS OF DATA COLLECTION
NEGATIVE
• 1) Time-consuming
• 2) Expensive
• 3) Limited field coverage
METHODS OF DATA COLLECTION
• Questionnaire (Indirect) Method – in
this method written responses are
given to prepared questions. A
questionnaire is used to elicit answers
to the problems of the study.
Questionnaires may be mailed or
hand-carried.
METHODS OF DATA COLLECTION
POSITIVE
• 1) Inexpensive
• 2) Can cover a wide area in a shorter span of time.
• 3) Respondents may feel a greater sense of
freedom to express views and opinions because
their anonymity is maintained.
METHODS OF DATA COLLECTION
NEGATIVE
• 1) There’s a strong possibility of non-response,
especially when questionnaires are mailed.
• 2) Questions not easily understood may not be
answered.
METHODS OF DATA COLLECTION
• C. Registration Method – this method of
gathering information is enforced by law.
E.g.
• Registration of births
• Deaths
• Vehicles
• Licenses
• Number of tourists in a City
METHODS OF DATA COLLECTION
POSITIVE
• 1) Information is kept systematized.
• 2) Information is always made available to
the public.
METHODS OF DATA COLLECTION
• D. Observation Method – the investigator
observes the behavior of the subject/respondent.
It is used when the subjects cannot talk or write.
POSITIVE
The recording of behavior at the appropriate time
and situation is made possible.
METHODS OF DATA COLLECTION
• E. Experiment Method - this method is used when
the objective is to determine the cause-and-effect
relationship of certain phenomena under
controlled conditions. It is usually used by scientific
researches.
DATA COLLECTION
PREPARATION
1. MAKE LOGISTICS ARRANGEMENTS.
• In order to make logistics arrangements, you will
have to (1) set up central local headquarters, (2)
contact local authorities where the survey will be
carried out.
2. PREPARE THE QUESTIONNAIRE AND
TRAINING MATERIALS.
• You must pre-test the translated
questionnaire in the field.
2. PREPARE THE QUESTIONNAIRE AND
TRAINING MATERIALS.
Specifically, the pre-test should answer the following
questions:
• Are respondents willing to answer questions in the way
you have asked them?
• Are any of the questions particularly difficult to answer or
do they address sensitive issues?
• Are the questions well understood by the respondents?
• Is it necessary to create new codes for common answers
which were not included in the original questionnaire?
3. CHOOSING AND PREPARING THE
EQUIPMENT
• Equipment must be purchased well in advance of
the survey. Examples are Weighing scales ,
Length/Height Boards,etc.
4. QUESTIONNAIRE CHECKING AGAIN
• Questionnaire checking involves eliminating
unacceptable questionnaires. These questionnaires
may be incomplete, instructions not followed, little
variance, missing pages, past cut-off date or
respondent not qualified.
5. COLLECTING DATA AND ANALYSIS
• Editing: Editing looks to correct illegible, incomplete,
inconsistent and ambiguous answers.
• Coding: Coding typically assigns alpha or numeric
codes to answers that do not already have them so that
statistical techniques can be applied.
• Transcribing: Transcribing data involves transferring data
so as to make it accessible to people or applications for
further processing.
• Cleaning: Cleaning reviews data for consistencies.
Inconsistencies may arise from faulty logic, out
of range or extreme values.
• Statistical adjustments: Statistical adjustments applies
to data that requires weighting and scale
transformations.
• Analysis strategy selection: Finally, selection of a data
analysis strategy is based on earlier work in designing
the research project but is finalized after consideration
of the characteristics of the data that has been
gathered.
DATA PREPARATION
WHAT IS DATA PREPARATION?
• Data preparation is about constructing a dataset
from one or more data sources to be used for
exploration and modeling. It is a solid practice to
start with an initial dataset to get familiar with the
data, to discover first insights into the data and
have a good understanding of any possible data
quality issues.
DATA PREPARATION
• Organizing the data correctly can save a lot of time
and prevent mistakes.
• Most researchers choose to use a database or
statistical analysis program (Microsoft Excel, SPSS)
that they can format to fit their needs in order to
organize their data effectively.
• Once the data has been entered, it is crucial that
the researcher check the data for accuracy.
STEPS IN DATA PREPARATION
• 1. Checking the Data For Accuracy
• As soon as data is received you should screen it for
accuracy. In some circumstances doing this right away will
allow you to go back to the sample to clarify any problems or
errors.
• Are the responses legible/readable?
• Are all important questions answered?
• Are the responses complete?
• Is all relevant contextual information included (e.g., data,
time, place, researcher)?
• 2. EDITING
• Editing detects error and omission correct them as
far as possible.
• Purpose:
• To ensure accuracy
• To bring about consistency with other information
• Make sure the data is uniformly entered.
• It is complete and arranged to simplify coding and
tabulation.
STEPS IN DATA PREPARATION
STEPS IN DATA PREPARATION
• 3. ENTERING THE DATA INTO THE COMPUTER
• the analyst should use a procedure called double entry.
• This double entry procedure significantly reduces entry
errors.
• An alternative is to enter the data once and set up a
procedure for checking the data for accuracy.
• EXAMPLE: you might spot check records on a random basis.
• An alternative is to enter the data once and set up a
procedure for checking the data for accuracy.
• you will use various programs to summarize the data that
allow you to check that all the data are within acceptable
limits and boundaries.
STEPS IN DATA PREPARATION
• 4. DATA TRANSFORMATIONS
• Once the data have been entered it is almost always
necessary to transform the raw data into variables that are
usable in the analyses.
• Missing values
• Many analysis programs automatically treat blank values as missing.
In others, you need to designate specific values to represent missing
values.
• Item reversals
• On scales and surveys, we sometimes use reversal items to help
reduce the possibility of a response set. When you analyze the data,
you want all scores for scale items to be in the same direction where
high scores mean the same thing and low scores mean the same
thing. In these cases, you have to reverse the ratings for some of the
scale items.
STEPS IN DATA PREPARATION
• Scale totals
• Once you've transformed any individual scale items
you will often want to add or average across
individual items to get a total score for the scale.
• Categories
• For many variables you will want to collapse them
into categories. For instance, you may want to
collapse income estimates (in dollar amounts) into
income ranges.
FREQUENCY DISTRIBUTION
TABLE
FREQUENCY DISTRIBUTION TABLE
• Frequency tells you how often something occurs.
The frequency of an observation in statistics tells you
the number of times the observation occurs in the
data.
• Frequency distribution tables can show
either categorical variables (sometimes called
qualitative variables) or quantitative
variables (sometimes called numeric variables). You
can think of categorical variables as being
categories (like eye color or brand of dog food)
and quantitative variables as being numbers.
GROUPED AND UNGROUPED DATA
• UNGROUPED FREQUENCY
DISTRIBUTION
• The data obtained in original form are
called raw data or ungrouped data.
• In an ungrouped frequency distribution,
the results are in order.
UNGROUPED DATA
• In each of 20 homes, people were asked how many
cars were registered to their households. The results
were recorded as follows:
• 1, 2, 1, 0, 3, 4, 0, 1, 1, 1, 2, 2, 3, 2, 3, 2, 1, 4, 0, 0
Number of
cars (x)
Tally Frequency
(f)
0 4
1 6
2 5
3 3
4 2
Table 1. Frequency table for the number of cars registered in each household
GROUPED DATA
• UNGROUPED DATA
• a moderate range of frequencies are
gathered together and compared to
a similar range.
GROUPED DATA
• GROUPED DATA
• a moderate range of frequencies are gathered together and
compared to a similar range
• CLASS FREQUENCY
• Number of observations belonging to a class interval.
• CLASS INTERVAL
• Refers to the grouping defined by a lower limit and upper limit
• CLASS BOUNDARIES
• The lower and the upper true limits
• CLASS MARKS
• Midpoint of each class interval and it is obtained by getting the
average of the lower class limit and the upper class limit
• CLASS SIZE
• difference between the upper class boundary and lower class
boundary of a class interval.
GROUPED DATA
• Thirty AA batteries were tested to determine how
long they would last. The results, to the nearest
minute, were recorded as follows:
• 423, 369, 387, 411, 393, 394, 371, 377, 389, 409, 392,
408, 431, 401, 363, 391, 405, 382, 400, 381, 399, 415,
428, 422, 396, 372, 410, 419, 386, 390
GROUPED DATA
• The lowest value is 363 and the highest is 431.
• Using the given data and a class interval of 10, the
interval for the first class is 360 to 369 and includes
363 (the lowest value). Remember, there should
always be enough class intervals so that the highest
value is included.
• * Number of class intervals (ideal nc= 5 to 20)
GROUPED DATA
Battery life, minutes (x) Tally Frequency (f)
360–369 2
370–379 3
380–389 5
390–399 7
400–409 5
410–419 4
420–429 3
430–439 1
Total 30
Table 3. Life of AA batteries, in minutes
CLASS BOUNDARY
CB CM <CF >CF
359.5-369.5
369.5-379.5
379.5-389.5
389.5-399.5
399.5-409.5
409.5-419.5
419.5-429.5
429.5-439.5
364
374
384
394
404
414
424
434
2
12
22
32
42
52
62
72
82
30
28
25
20
13
8
4
1
1
CROSS-TABULATION
CROSS-TABULATION
• Cross tabulation is a tool that allows you compare
the relationship between two variables.
• A cross-tabulation is a two (or more) dimensional
table that records the number (frequency) of
respondents that have the specific characteristics
described in the cells of the table.
• The Chi-square statistic is the primary statistic
used for testing the statistical significance of
the cross-tabulation table. Chi-square tests
whether or not the two variables are
independent.
CROSS TABULATION WITH CHI SQUARE
ANALYSIS
CHI SQUARE ANALYSIS
• The chi-square statistic is computed by first
computing a chi-square value for each individual
cell of the table and then summing them up to form
a total Chi-square value for the table. The chi-
square value for the cell is computed as:
(Observed Value – Expected Value)2 /
(Expected Value)
REMEMBER
• The chi-square statistic, along with the associated
probability of chance observation, may be
computed for any table. If the variables are related
(i.e. the observed table relationships would occur
with very low probability, say only 5%) then we say
that the results are “statistically significant” at the
“.05 or 5% level”. This means that the variables have
a low chance of being independent.
SPSS TUTORIAL FOR CROSS
TABULATION
What is SPSS?
"SPSS is a comprehensive system for analyzing data.
SPSS is the acronym of Statistical Package for
the Social Science
SPSS can take data from almost any type of file and
use them to generate tabulated reports, charts,
and plots of distributions and trends, descriptive
statistics, and complex statistical analysis."
THANK YOU AND GOD BLESS! 

More Related Content

What's hot (20)

SAMPLING DESIGNS
SAMPLING DESIGNSSAMPLING DESIGNS
SAMPLING DESIGNS
 
Chapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATIONChapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATION
 
PRESENTATION OF STATISTICAL DATA
PRESENTATION OF STATISTICAL DATAPRESENTATION OF STATISTICAL DATA
PRESENTATION OF STATISTICAL DATA
 
What is Data? in Statistics
What is Data? in StatisticsWhat is Data? in Statistics
What is Data? in Statistics
 
Tabulation of data
Tabulation of dataTabulation of data
Tabulation of data
 
Methods of data collection (research methodology)
Methods of data collection  (research methodology)Methods of data collection  (research methodology)
Methods of data collection (research methodology)
 
Data Collection and Analysis Tools
Data Collection and Analysis ToolsData Collection and Analysis Tools
Data Collection and Analysis Tools
 
1.2 types of data
1.2 types of data1.2 types of data
1.2 types of data
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Data analysis
Data analysisData analysis
Data analysis
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Presentation of Data (thesis writing)
Presentation of Data (thesis writing)Presentation of Data (thesis writing)
Presentation of Data (thesis writing)
 
Data and data collection procedures
Data and data collection proceduresData and data collection procedures
Data and data collection procedures
 
Data collection presentation
Data collection presentationData collection presentation
Data collection presentation
 
Data editing and coding
Data editing and codingData editing and coding
Data editing and coding
 
DATA Types
DATA TypesDATA Types
DATA Types
 
Analyzing survey data
Analyzing survey dataAnalyzing survey data
Analyzing survey data
 
Data presentation 2
Data presentation 2Data presentation 2
Data presentation 2
 
Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
 
Classification of data
Classification of dataClassification of data
Classification of data
 

Viewers also liked

Fieldwork 2015 data collection stage
Fieldwork 2015   data collection stageFieldwork 2015   data collection stage
Fieldwork 2015 data collection stagebarc300
 
2 data preparation process
2 data preparation process2 data preparation process
2 data preparation processsasikun
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of varianceHemant Sharma
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Upama Dwivedi
 
RESEARCH METHODOLOGY- PROCESSING OF DATA
RESEARCH METHODOLOGY- PROCESSING OF DATARESEARCH METHODOLOGY- PROCESSING OF DATA
RESEARCH METHODOLOGY- PROCESSING OF DATAjeni jerry
 
Resource Scheduling
Resource SchedulingResource Scheduling
Resource SchedulingNicola2903
 
research methodology data processing EDITING
research methodology data processing EDITING research methodology data processing EDITING
research methodology data processing EDITING Suvin Lal
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and ProcessingMehul Gondaliya
 
Research process
Research processResearch process
Research processaditi garg
 

Viewers also liked (13)

Fieldwork 2015 data collection stage
Fieldwork 2015   data collection stageFieldwork 2015   data collection stage
Fieldwork 2015 data collection stage
 
2 data preparation process
2 data preparation process2 data preparation process
2 data preparation process
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
Resource scheduling
Resource schedulingResource scheduling
Resource scheduling
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
 
P5 ungrouped data
P5 ungrouped dataP5 ungrouped data
P5 ungrouped data
 
RESEARCH METHODOLOGY- PROCESSING OF DATA
RESEARCH METHODOLOGY- PROCESSING OF DATARESEARCH METHODOLOGY- PROCESSING OF DATA
RESEARCH METHODOLOGY- PROCESSING OF DATA
 
Resource Scheduling
Resource SchedulingResource Scheduling
Resource Scheduling
 
Recruitment
RecruitmentRecruitment
Recruitment
 
research methodology data processing EDITING
research methodology data processing EDITING research methodology data processing EDITING
research methodology data processing EDITING
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Chi square test
Chi square testChi square test
Chi square test
 
Research process
Research processResearch process
Research process
 

Similar to Data Collection Preparation

Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aRai University
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersRupa Verma
 
DATA PROCESSING on marketing research...
DATA PROCESSING on marketing research...DATA PROCESSING on marketing research...
DATA PROCESSING on marketing research...120SupritBhuyan
 
RSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSRSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSWesam Abuznadah
 
Lecture 1- data preparation.pptx
Lecture 1- data preparation.pptxLecture 1- data preparation.pptx
Lecture 1- data preparation.pptxEricRajat
 
5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptxHimaniPandya13
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
Unit 4 editing and coding (2)
Unit 4 editing and coding (2)Unit 4 editing and coding (2)
Unit 4 editing and coding (2)kalailakshmi
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesVaibhav Khanna
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxProf. Kanchan Kumari
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxProf. Kanchan Kumari
 
Qualitative and quantitative analysis
Qualitative and quantitative analysisQualitative and quantitative analysis
Qualitative and quantitative analysisNellie Deutsch (Ed.D)
 
COMMUNITY NEED ASSESSMENT.pptx
COMMUNITY NEED ASSESSMENT.pptxCOMMUNITY NEED ASSESSMENT.pptx
COMMUNITY NEED ASSESSMENT.pptxGhaffarAhmed9
 

Similar to Data Collection Preparation (20)

Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse Researchers
 
DATA PROCESSING on marketing research...
DATA PROCESSING on marketing research...DATA PROCESSING on marketing research...
DATA PROCESSING on marketing research...
 
Data processing.pdf
Data processing.pdfData processing.pdf
Data processing.pdf
 
RSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSRSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSS
 
Lecture 1- data preparation.pptx
Lecture 1- data preparation.pptxLecture 1- data preparation.pptx
Lecture 1- data preparation.pptx
 
5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
Unit 5.pptx
Unit 5.pptxUnit 5.pptx
Unit 5.pptx
 
Unit 4 editing and coding (2)
Unit 4 editing and coding (2)Unit 4 editing and coding (2)
Unit 4 editing and coding (2)
 
Missing data
Missing dataMissing data
Missing data
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniques
 
1. Data Process.pptx
1. Data Process.pptx1. Data Process.pptx
1. Data Process.pptx
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptx
 
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptxunit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptx
 
Data Processing
 Data Processing Data Processing
Data Processing
 
Qualitative and quantitative analysis
Qualitative and quantitative analysisQualitative and quantitative analysis
Qualitative and quantitative analysis
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
COMMUNITY NEED ASSESSMENT.pptx
COMMUNITY NEED ASSESSMENT.pptxCOMMUNITY NEED ASSESSMENT.pptx
COMMUNITY NEED ASSESSMENT.pptx
 
ANALYSIS OF DATA (2).pptx
ANALYSIS OF DATA (2).pptxANALYSIS OF DATA (2).pptx
ANALYSIS OF DATA (2).pptx
 

Recently uploaded

CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Onlineanilsa9823
 
Factors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptxFactors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptxVikasTiwari846641
 
personal branding kit for music business
personal branding kit for music businesspersonal branding kit for music business
personal branding kit for music businessbrjohnson6
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxZACGaming
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...aditipandeya
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRSapana Sha
 
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?riteshhsociall
 
The+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdfThe+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdfSocial Samosa
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Richard Ingilby
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptxVikasTiwari846641
 
Major SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalMajor SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalBanyanbrain
 
Social Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaSocial Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaadityabelde2
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxelizabethella096
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesSearch Engine Journal
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessAggregage
 
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBalmerLawrie
 
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxUnraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxelizabethella096
 

Recently uploaded (20)

CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
 
Factors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptxFactors-Influencing-Branding-Strategies.pptx
Factors-Influencing-Branding-Strategies.pptx
 
personal branding kit for music business
personal branding kit for music businesspersonal branding kit for music business
personal branding kit for music business
 
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptxDigital-Marketing-Into-by-Zoraiz-Ahmad.pptx
Digital-Marketing-Into-by-Zoraiz-Ahmad.pptx
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCR
 
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 144 Noida Escorts >༒8448380779 Escort Service
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?
 
The+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdfThe+State+of+Careers+In+Retention+Marketing-2.pdf
The+State+of+Careers+In+Retention+Marketing-2.pdf
 
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
Top 5 Breakthrough AI Innovations Elevating Content Creation and Personalizat...
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
 
Branding strategies of new company .pptx
Branding strategies of new company .pptxBranding strategies of new company .pptx
Branding strategies of new company .pptx
 
Major SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain DigitalMajor SEO Trends in 2024 - Banyanbrain Digital
Major SEO Trends in 2024 - Banyanbrain Digital
 
Social Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid mediaSocial Media Marketing PPT-Includes Paid media
Social Media Marketing PPT-Includes Paid media
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptx
 
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best StrategiesGoogle 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
Google 3rd-Party Cookie Deprecation [Update] + 5 Best Strategies
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
 
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
 
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptxUnraveling the Mystery of the Hinterkaifeck Murders.pptx
Unraveling the Mystery of the Hinterkaifeck Murders.pptx
 
Brand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLaneBrand Strategy Master Class - Juntae DeLane
Brand Strategy Master Class - Juntae DeLane
 

Data Collection Preparation

  • 1. D A T A P R E P A R A T I O N | F R E Q U E N C Y D I S T R I B U T I O N | C R O S S - T A B U L A T I O N DATA COLLECTION PREPARATION
  • 2. DATA • Data is anything that has been produced or created during research. Primary data is data that you have created yourself, but your data sets can also contain data that has been created by other researchers.
  • 3. WHAT IS DATA COLLECTION? • It is the process of gathering and measuring information on variables of interest, in an established systematic fashion that enables one to answer stated research questions, test hypotheses, and evaluate outcomes.
  • 4. METHODS OF DATA COLLECTION • A. Interview (Direct) Method – a method of person-to-person exchange between the interviewer and the interviewee.
  • 5. METHODS OF DATA COLLECTION POSITIVE • 1) It provides consistent and more precise information since clarification maybe given by the interviewee. • 2) Questions maybe repeated or maybe modified to suit the interviewee’s level of understanding.
  • 6. METHODS OF DATA COLLECTION NEGATIVE • 1) Time-consuming • 2) Expensive • 3) Limited field coverage
  • 7. METHODS OF DATA COLLECTION • Questionnaire (Indirect) Method – in this method written responses are given to prepared questions. A questionnaire is used to elicit answers to the problems of the study. Questionnaires may be mailed or hand-carried.
  • 8. METHODS OF DATA COLLECTION POSITIVE • 1) Inexpensive • 2) Can cover a wide area in a shorter span of time. • 3) Respondents may feel a greater sense of freedom to express views and opinions because their anonymity is maintained.
  • 9. METHODS OF DATA COLLECTION NEGATIVE • 1) There’s a strong possibility of non-response, especially when questionnaires are mailed. • 2) Questions not easily understood may not be answered.
  • 10. METHODS OF DATA COLLECTION • C. Registration Method – this method of gathering information is enforced by law. E.g. • Registration of births • Deaths • Vehicles • Licenses • Number of tourists in a City
  • 11. METHODS OF DATA COLLECTION POSITIVE • 1) Information is kept systematized. • 2) Information is always made available to the public.
  • 12. METHODS OF DATA COLLECTION • D. Observation Method – the investigator observes the behavior of the subject/respondent. It is used when the subjects cannot talk or write. POSITIVE The recording of behavior at the appropriate time and situation is made possible.
  • 13. METHODS OF DATA COLLECTION • E. Experiment Method - this method is used when the objective is to determine the cause-and-effect relationship of certain phenomena under controlled conditions. It is usually used by scientific researches.
  • 15. 1. MAKE LOGISTICS ARRANGEMENTS. • In order to make logistics arrangements, you will have to (1) set up central local headquarters, (2) contact local authorities where the survey will be carried out.
  • 16. 2. PREPARE THE QUESTIONNAIRE AND TRAINING MATERIALS. • You must pre-test the translated questionnaire in the field.
  • 17. 2. PREPARE THE QUESTIONNAIRE AND TRAINING MATERIALS. Specifically, the pre-test should answer the following questions: • Are respondents willing to answer questions in the way you have asked them? • Are any of the questions particularly difficult to answer or do they address sensitive issues? • Are the questions well understood by the respondents? • Is it necessary to create new codes for common answers which were not included in the original questionnaire?
  • 18. 3. CHOOSING AND PREPARING THE EQUIPMENT • Equipment must be purchased well in advance of the survey. Examples are Weighing scales , Length/Height Boards,etc.
  • 19. 4. QUESTIONNAIRE CHECKING AGAIN • Questionnaire checking involves eliminating unacceptable questionnaires. These questionnaires may be incomplete, instructions not followed, little variance, missing pages, past cut-off date or respondent not qualified.
  • 20. 5. COLLECTING DATA AND ANALYSIS • Editing: Editing looks to correct illegible, incomplete, inconsistent and ambiguous answers. • Coding: Coding typically assigns alpha or numeric codes to answers that do not already have them so that statistical techniques can be applied. • Transcribing: Transcribing data involves transferring data so as to make it accessible to people or applications for further processing.
  • 21. • Cleaning: Cleaning reviews data for consistencies. Inconsistencies may arise from faulty logic, out of range or extreme values. • Statistical adjustments: Statistical adjustments applies to data that requires weighting and scale transformations. • Analysis strategy selection: Finally, selection of a data analysis strategy is based on earlier work in designing the research project but is finalized after consideration of the characteristics of the data that has been gathered.
  • 23. WHAT IS DATA PREPARATION? • Data preparation is about constructing a dataset from one or more data sources to be used for exploration and modeling. It is a solid practice to start with an initial dataset to get familiar with the data, to discover first insights into the data and have a good understanding of any possible data quality issues.
  • 24. DATA PREPARATION • Organizing the data correctly can save a lot of time and prevent mistakes. • Most researchers choose to use a database or statistical analysis program (Microsoft Excel, SPSS) that they can format to fit their needs in order to organize their data effectively. • Once the data has been entered, it is crucial that the researcher check the data for accuracy.
  • 25. STEPS IN DATA PREPARATION • 1. Checking the Data For Accuracy • As soon as data is received you should screen it for accuracy. In some circumstances doing this right away will allow you to go back to the sample to clarify any problems or errors. • Are the responses legible/readable? • Are all important questions answered? • Are the responses complete? • Is all relevant contextual information included (e.g., data, time, place, researcher)?
  • 26. • 2. EDITING • Editing detects error and omission correct them as far as possible. • Purpose: • To ensure accuracy • To bring about consistency with other information • Make sure the data is uniformly entered. • It is complete and arranged to simplify coding and tabulation. STEPS IN DATA PREPARATION
  • 27. STEPS IN DATA PREPARATION • 3. ENTERING THE DATA INTO THE COMPUTER • the analyst should use a procedure called double entry. • This double entry procedure significantly reduces entry errors. • An alternative is to enter the data once and set up a procedure for checking the data for accuracy. • EXAMPLE: you might spot check records on a random basis. • An alternative is to enter the data once and set up a procedure for checking the data for accuracy. • you will use various programs to summarize the data that allow you to check that all the data are within acceptable limits and boundaries.
  • 28. STEPS IN DATA PREPARATION • 4. DATA TRANSFORMATIONS • Once the data have been entered it is almost always necessary to transform the raw data into variables that are usable in the analyses. • Missing values • Many analysis programs automatically treat blank values as missing. In others, you need to designate specific values to represent missing values. • Item reversals • On scales and surveys, we sometimes use reversal items to help reduce the possibility of a response set. When you analyze the data, you want all scores for scale items to be in the same direction where high scores mean the same thing and low scores mean the same thing. In these cases, you have to reverse the ratings for some of the scale items.
  • 29. STEPS IN DATA PREPARATION • Scale totals • Once you've transformed any individual scale items you will often want to add or average across individual items to get a total score for the scale. • Categories • For many variables you will want to collapse them into categories. For instance, you may want to collapse income estimates (in dollar amounts) into income ranges.
  • 31. FREQUENCY DISTRIBUTION TABLE • Frequency tells you how often something occurs. The frequency of an observation in statistics tells you the number of times the observation occurs in the data. • Frequency distribution tables can show either categorical variables (sometimes called qualitative variables) or quantitative variables (sometimes called numeric variables). You can think of categorical variables as being categories (like eye color or brand of dog food) and quantitative variables as being numbers.
  • 32. GROUPED AND UNGROUPED DATA • UNGROUPED FREQUENCY DISTRIBUTION • The data obtained in original form are called raw data or ungrouped data. • In an ungrouped frequency distribution, the results are in order.
  • 33. UNGROUPED DATA • In each of 20 homes, people were asked how many cars were registered to their households. The results were recorded as follows: • 1, 2, 1, 0, 3, 4, 0, 1, 1, 1, 2, 2, 3, 2, 3, 2, 1, 4, 0, 0 Number of cars (x) Tally Frequency (f) 0 4 1 6 2 5 3 3 4 2 Table 1. Frequency table for the number of cars registered in each household
  • 34. GROUPED DATA • UNGROUPED DATA • a moderate range of frequencies are gathered together and compared to a similar range.
  • 35. GROUPED DATA • GROUPED DATA • a moderate range of frequencies are gathered together and compared to a similar range • CLASS FREQUENCY • Number of observations belonging to a class interval. • CLASS INTERVAL • Refers to the grouping defined by a lower limit and upper limit • CLASS BOUNDARIES • The lower and the upper true limits • CLASS MARKS • Midpoint of each class interval and it is obtained by getting the average of the lower class limit and the upper class limit • CLASS SIZE • difference between the upper class boundary and lower class boundary of a class interval.
  • 36. GROUPED DATA • Thirty AA batteries were tested to determine how long they would last. The results, to the nearest minute, were recorded as follows: • 423, 369, 387, 411, 393, 394, 371, 377, 389, 409, 392, 408, 431, 401, 363, 391, 405, 382, 400, 381, 399, 415, 428, 422, 396, 372, 410, 419, 386, 390
  • 37. GROUPED DATA • The lowest value is 363 and the highest is 431. • Using the given data and a class interval of 10, the interval for the first class is 360 to 369 and includes 363 (the lowest value). Remember, there should always be enough class intervals so that the highest value is included. • * Number of class intervals (ideal nc= 5 to 20)
  • 38. GROUPED DATA Battery life, minutes (x) Tally Frequency (f) 360–369 2 370–379 3 380–389 5 390–399 7 400–409 5 410–419 4 420–429 3 430–439 1 Total 30 Table 3. Life of AA batteries, in minutes
  • 39. CLASS BOUNDARY CB CM <CF >CF 359.5-369.5 369.5-379.5 379.5-389.5 389.5-399.5 399.5-409.5 409.5-419.5 419.5-429.5 429.5-439.5 364 374 384 394 404 414 424 434 2 12 22 32 42 52 62 72 82 30 28 25 20 13 8 4 1 1
  • 41. CROSS-TABULATION • Cross tabulation is a tool that allows you compare the relationship between two variables. • A cross-tabulation is a two (or more) dimensional table that records the number (frequency) of respondents that have the specific characteristics described in the cells of the table.
  • 42. • The Chi-square statistic is the primary statistic used for testing the statistical significance of the cross-tabulation table. Chi-square tests whether or not the two variables are independent. CROSS TABULATION WITH CHI SQUARE ANALYSIS
  • 43. CHI SQUARE ANALYSIS • The chi-square statistic is computed by first computing a chi-square value for each individual cell of the table and then summing them up to form a total Chi-square value for the table. The chi- square value for the cell is computed as: (Observed Value – Expected Value)2 / (Expected Value)
  • 44. REMEMBER • The chi-square statistic, along with the associated probability of chance observation, may be computed for any table. If the variables are related (i.e. the observed table relationships would occur with very low probability, say only 5%) then we say that the results are “statistically significant” at the “.05 or 5% level”. This means that the variables have a low chance of being independent.
  • 45. SPSS TUTORIAL FOR CROSS TABULATION What is SPSS? "SPSS is a comprehensive system for analyzing data. SPSS is the acronym of Statistical Package for the Social Science SPSS can take data from almost any type of file and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and complex statistical analysis."
  • 46. THANK YOU AND GOD BLESS! 