Successfully reported this slideshow.
Upcoming SlideShare
×

# Building Mathematical Ability Foundation Course PowerPoint Presentation-Data Collection Techniques and Population

2,274 views

Published on

PowerPoint created for Delhi University's FYUP Foundation Course-Building Mathematical Ability on the topic-
Data Collection Techniques and Population

Published in: Education, Technology
• Full Name
Comment goes here.

Are you sure you want to Yes No
• Be the first to comment

### Building Mathematical Ability Foundation Course PowerPoint Presentation-Data Collection Techniques and Population

1. 1. BUILDING MATHEMATICAL ABILITY
2. 2. PRESENTED BY: • Parth Nagpal (11332576) • Krati Goyal (11332550) • Sonali Kalra (1133260) • Arun (11332485) • Pratibha Sharma (11332559) • Akash Verma (11332529) • Rahul (11332487) • Pankaj Kumar (11332532) • Dhirendra Jaiswal (11332484) • Mridul Patel (11332490)
3. 3. DATA COLLECTION TECHNIQUES Sub Heads • Methods to collect data – questionnaire, observations, recording, etc. • Population and sampling
4. 4. DATA Data are values of quantitative or qualitative variables, belonging to a set of items. Eg: Data regarding the no. of seats for some of the science courses in a college Course No. of Seats Mathematics 120 Physics 95 Chemistry 78 Statistics 50
5. 5. TYPES OF DATA
6. 6. Quantitative data are measurements that are recorded on a naturally occurring numerical scale.
7. 7. IDENTIFICATION Measured on a numeric scale. • Number of defective items in a lot. • Salaries of CEOs of oil companies. • Ages of employees at a company. 4 943 52 21 12 120 8 71 3
8. 8. Qualitative data are measurements that cannot be measured on a natural numerical scale; they can only be classified into a group of categories.
9. 9. IDENTIFICATION Classified into categories. • College major of each student in a class. •Gender of each employee at a company. • Method of payment (cash, check, credit card).
10. 10. SOURCES OF DATA COLLECTION
11. 11. PRIMARY DATA Primary data is a type of information that is obtained directly from first-hand sources by means of surveys, observation or experimentation. It is the data that has not been previously published and is derived from a new or original research study and collected at the source such as in marketing.
12. 12. METHODS OF COLLECTING PRIMARY DATA • Interview • Questionnaire • Observations
13. 13. SECONDARY DATA Secondary data is any information collected by someone else other than it's user. It is data that has already been collected and is readily available for use. Secondary data saves on time as compared to primary data which has to be collected and analysed before use.
14. 14. METHODS OF COLLECTING SECONDARY DATA Published Sources • International Publication – UNO,WTO,etc. • Government Publications – Central and State Govt. • Publication- Municipal Corporations,boards, etc.  Unpublished Sources • Research Works • Records of Private Firms
15. 15. QUESTIONNAIRE A series of questions asked to individuals to obtain information statistically useful about a certain topic.
16. 16. ADVANTAGES • They can be easily be fed back to employees. • Questionnaires can be standard or customized. • Large amounts of information can be gathered. • Can be analyzed easily. • Simple and quick to fill up by the respondent. • Can be used for sensitive topics. • Respondents have time to think about their answers. • Format is familiar to most respondents.
17. 17. DISADVANTAGES • Respondents don‟t always answer honestly. • No additional questions can be asked. • Appear impersonal. • Respondents may misunderstand. • Unsuitable for some kind of respondents. • Respondents may ignore questions. • Different respondents may interpret differently. • Danger of questionnaire fatigue. 17
18. 18. KEY POINTS FOR MAKING AN IDEAL QUESTIONNAIRE • • • • • • Keep it as short as possible. Ask short, simple, clearly worded questions. Start with demographic questions. Use open ended questions cautiously. Place the questions in logical order. Try to minimize effort to be put in by the respondent. • Put in more open ended questions.
19. 19. INTERVIEWS • This method of collecting data involves presentation or oral-verbal stimuli and reply in terms of oral-verbal responses. • Interviews are probably the most widely used technique for collecting data. • They permit the interviewer to ask the respondent direct questions.
20. 20. TYPES OF INTERVIEW • Structured: Pre-established questions • Unstructured: Draw out information without the use of pre-established questions • Semi-Structured: A mixture of both strategies
21. 21. FORMAL AND INFORMAL INTERVIEW Formal: A formal interview is just that, formal. It includes: the office setting; the formal handshake; appropriate attire; order and structure; and best professional behavior. Informal: An informal interview attempts to ignore the rules and roles associated with interviewing in an attempt to gain trust and create a more natural environment for an open and honest communication.
22. 22. INTERVIEWING TIPS • Keep language pitched to that of respondent • Avoid long questions • Create comfort • Establish time frame for interview • Avoid leading questions • Sequence topics • Be respectful • Listen carefully
23. 23. ADVANTAGES AND DISADVANTAGES OF INTERVIEWS Advantages Disadvantages Deep and free response Costly in time and personnel Flexible, adaptable Requires skill Glimpse into respondent‟s tone, gestures May be difficult to summarize responses Ability to probe, follow-up Possible biases: interviewer, respondent, situation
24. 24. EXAMPLE Suppose IBN7 Managing Editor Ashutosh comes to KMC to interview our Principal, Dr. SP Gupta. Now, by interviewing Dr. SP Gupta, he can get data about our college like, • What is the total budget allotted by DU to KMC? • What is the the total strength of the teaching/Non-teaching staff? • How many rooms are available for the lectures? And so on……
25. 25. OBSERVATION • Observation: A systematic method of data collection that relies on a researcher‟s ability to gather data through their senses • Observe: To notice using a full range of appropriate senses. To see, hear, feel, taste, and smell
26. 26. ADVANTAGES TO OBSERVATION: • They are free from the biases inherent in the self-report data. • They put the practitioner directly in touch with the behaviors in question. • They involved real-time data, describing behavior occurring in the present rather than the past.
27. 27. PROBLEMS WITH OBSERVATION • Difficulties interpreting the meaning underlying the observations. • Observers must decide which people to observe; choose time periods, territory and events • Failure to attend to these sampling issues can result in a biased sample of data.
28. 28. EXAMPLES See what is happening • traffic patterns • land use patterns • layout of city and rural areas • quality of housing • condition of roads • conditions of buildings • who goes to a health clinic
29. 29. INFORMAL EXAMPLE During our Manali tour, I observed some things that can provide a kind of data regarding the tour. The observations are as follows: • There were 3 teachers, 9 -1st year, 2 -2nd year and 31- 3rd students. • Only 1 student among 45 demanded onion/garlic less food. • Most rooms were booked on triple sharing basis and 3-4 rooms on quad sharing basis. • Only 1 student among 45 was a minor.
30. 30. THINGS TO CONSIDER • All data collection methods are capable of gathering quantitative and qualitative data, although some may be better suited towards one task or the • There is no single data collection method that can guarantee credible data • All data collection methods can be consciously manipulated • All data collection methods can be „contaminated‟ by unrecognized bias • All data collection methods require conscious deliberation on the part of the researcher to ensure credibility
31. 31. POPULATION
32. 32. WHAT IS A POPULATION? • A population is any complete group with at least one characteristic in common. • Populations are not just people. • Populations may consist of people, animals, businesses, buildings, motor vehicles, farms, objects or events.
33. 33. WHY DO YOU NEED TO KNOW WHO OR WHAT ARE IN A POPULATION? • When looking at data, it is important to clearly identify the population being studied or referred to, so that you can understand who or what are included in the data. • For example, if you were looking at some Indian farming data, you would need to understand whether the population, the data refers to, is all farms in India, just farms that grow crops, those that only have livestock, or some other type of farm.
34. 34. SAMPLING Sampling is the process of selecting a number of individuals for a study in such a way that the individuals represent the larger group from which they were selected
35. 35. Sample The representatives selected for a study whose characteristics match the larger group from which they were selected.
36. 36. SAMPLING FRAME • A sampling frame is the source material from which a sample is drawn. • It is a list of all those within a population who can be sampled, and may include individuals, households or institutions
37. 37. SAMPLING BREAKDOWN
38. 38. BASIC METHODS OF SAMPLING Probability sampling Nonprobability sampling
39. 39. Probability Sampling Simple Random Sampling Systematic Random Sampling Stratified Random Sampling Cluster Sampling
40. 40. SIMPLE RANDOM SAMPLING • It is the basic random sampling technique where a group of subjects (a sample) is selected for study from a larger group (a population). • Every experimental unit is chosen entirely by chance and each member of the population has an equal chance of being included in the sample. Examples: Lottery, generation of random numbers/digits.
41. 41. LOTTERY SAMPLING Procedures: 1. Write down the name of each member of the population on pieces of paper. 2. Place these papers in a box or a container drum. 3. The box or lottery drum must be shaken thoroughly to prevent some pieces of paper from sinking at the bottom. 4. Picked the required number of sample units from the lottery drum.
42. 42. SYSTEMATIC SAMPLING • Select a random starting point using chits,etc. and then select every kth subject in the population • Simple to use so it is used often where n is the sample size, and N is the population size
43. 43. Example: Choosing a sample of size 84 from 500. k = N/n where N = 500 and n = 84 k = 500/84 k = 5.95 k 6
44. 44. STRATIFIED SAMPLING • The population is divided into two or more groups called strata, according to some criterion, such as geographic location, grade level, age, or income, and subsamples are randomly selected from each strata.
45. 45. CLUSTER SAMPLING Divide the population into groups (called clusters), randomly select some of the groups, and then collect data from ALL members of the selected groups Used extensively by government and private research organizations Examples: Exit Polls
46. 46. Nonprobability Sampling Convenience Sampling Judgment Sampling Quota Sampling Snowball Sampling
47. 47. CONVENIENCE SAMPLING It attempts to obtain a sample of convenient elements. Often, respondents are selected because they happen to be in the right place at the right time. Examples: Using family members or students in a classroom Mall shoppers
48. 48. JUDGMENT/PURPOSIVE SAMPLING It is a form of convenience sampling in which the population elements are selected based on the Judgment of the researcher. Examples: Test markets, engineers selected in industrial marketing research, expert witnesses used in court,etc.
49. 49. QUOTA SAMPLING Quota sampling may be viewed as two-stage restricted judgmental sampling: • The first stage consists of developing control categories, or quotas, of population elements. • In the second stage, sample elements are selected based on convenience or judgment.
50. 50. SNOWBALL SAMPLING • In snowball sampling, an initial group of respondents is selected, usually at random. • After being interviewed, these respondents are asked to identify others who belong to the target population of interest. • Subsequent respondents are selected based on the referrals.
51. 51. SAMPLING PROCESS
52. 52. 1. DEFINE POPULATION TO BE SAMPLED... Identify the group of interest and its characteristics to which the findings of the study will be generalized. Note: Mostly the “accessible” or “available” population must be used.
53. 53. 2. DETERMINE THE SAMPLE SIZE • The size of the sample influences both the representativeness of the sample and the statistical analysis of the data Note: 1.Larger samples are more likely to detect a difference between different groups. 2.Smaller samples are more likely not to be representative.
54. 54. RULES OF THUMB FOR DETERMINING THE SAMPLE SIZE • The larger the population size, the smaller the percentage of the population required to get a representative sample • For smaller samples (N ‹ 100), there is little point in sampling. Survey the entire population. • If the population size is around 500, 50% should be sampled. • If the population size is around 1500, 20% should be sampled. • Beyond a certain point (N = 5000), the population size is almost irrelevant and a sample size of 400 may be adequate.
55. 55. 3. CONTROL FOR SAMPLING BIAS AND ERROR • Be aware of the sources of sampling bias and identify how to avoid it. • Decide whether the bias is so severe that the results of the study will be seriously affected in the final report, document awareness of bias, rationale for proceeding, and potential effects
56. 56. 4. SELECT THE SAMPLE... • A process by which the researcher attempts to ensure that the sample is representative of the population from which it is to be selected. Note: Requires identifying the sampling method that will be used.
57. 57. TYPES OF ERRORS
58. 58. • In general, there are two types of errors: 1) non-sampling errors 2)sampling errors
59. 59. SAMPLING ERROR • These are errors that arise because data has been collected from a part, rather than the whole of the population. • Because of the above, sampling errors are restricted to sample surveys only unlike non-sampling errors that can occur in both sample surveys and censuses data. • There are no sampling errors in a census because the calculations are based on the entire population. • They are measurable from the sample data in the case of probability sampling.
60. 60. FACTORS AFFECTING SAMPLING ERROR It is affected by a number of factors including: Sample Size • In general, larger sample sizes decrease the sampling error, however this decrease is not directly proportional. • As a rough rule of the thumb, you need to increase the sample size. The Sampling Fraction • This is of lesser influence but as the sample size increases as a fraction of the population, the sampling error should decrease.
61. 61. CONTINUED… The Variability Within The Population. • More variable populations give rise to larger errors as the samples or the estimates calculated from different samples are more likely to have greater variation. • The effect of variability within the population can be reduced by the use of stratification. Sample Design • An efficient sampling design will help in reducing sampling error.
62. 62. NON-SAMPLING ERROR • Nonsampling errors are more serious and are due to mistakes made in the acquisition of data or due to the sample observations being selected improperly. • These are errors that arise during the course of all data collection activities. • In summary, they have the following characteristics: 1) exist in both sample surveys and censuses data. 2) difficult to measure .
63. 63. SOURCES OF NON-SAMPLING ERRORS Non-sampling errors arise from: • defects in the sampling frame. • failure to identify the target population. • non response. • responses given by respondents.
64. 64. REDUCING NON-SAMPLING ERRORS Can be minimised by adopting any of the following approaches: • using an up-to-date and accurate sampling frame. • careful selection of the time the survey is conducted. • careful questionnaire design. • providing thorough training and periodic retraining of interviewers and processing staff.
65. 65. WHAT IS A SURVEY ? • A “survey” is a systematic method of gathering information from a sample. • A survey usually originates when an individual or institution is confronted with an information need and the existing data are insufficient
66. 66. MAIL / PHONE / INTERNET SURVEYS • Literacy issues • Consider accessibility  reliability of postal service  turn-around time • Consider bias  What population segment has telephone access? Internet access?
67. 67. ADVANTAGES AND CHALLENGES OF SURVEYS Advantages Challenges Best when you want to know what people think, believe, or perceive, only they can tell you that. People may not accurately recall their behavior or may be reluctant to reveal their behavior if it is illegal. What people think they do or say they do is not always the same as what they actually do.
68. 68. SURVEY TIPS • Keep them short (under 5 minutes) • Avoid huge long checklists • Allow for text comments • Allow for categorical identifications -- school, job function, grade, etc.
69. 69. BASIC SURVEY TYPES • Surveys can be administered in a number of ways: • Face to face • Telephone • Self-administered
70. 70. RESPONSE CATEGORIES • Survey questions can either be open or closed: • Open questions: These questions ask respondents to construct answers using their own words. Open questions can generate rich and candid data, but it can be data that is difficult to code and analyse • Closed questions: These questions force respondents to choose from a range of predetermined responses, and are generally easy to code and statistically analyse .