SlideShare a Scribd company logo
Data Collection Methods
Pros and Cons of Primary and
Secondary Data
Where do data come
from?
   We’ve seen our data for this lab, all
    nice and collated in a database – from:
    – Insurance companies (claims,
      medications, procedures, diagnoses, etc.)
    – Firms (demographic data, productivity
      data, etc.)
Where do data come
from?
   Take a step back – if we’re starting
    from scratch, how do we collect / find
    data?
    – Secondary data
    – Primary data
Secondary Data

   Secondary data – data someone else
    has collected
    – This is what you were looking for in your
      assignment.
Secondary Data –
Examples of Sources
   County health departments
   Vital Statistics – birth, death certificates
   Hospital, clinic, school nurse records
   Private and foundation databases
   City and county governments
   Surveillance data from state government
    programs
   Federal agency statistics - Census, NIH, etc.
Secondary Data –
Limitations
   What did you find on the frustrating
    side as you looked for data on the
    state’s websites?
Secondary Data –
Limitations
   When was it collected? For how long?
    – May be out of date for what you want to
      analyze.
    – May not have been collected long enough
      for detecting trends.
    – E.g. Have new anticorruption laws
      impacted Russia’s government
      accountability ratings?
Secondary Data –
Limitations
   Is the data set complete?
    – There may be missing information on
      some observations
    – Unless such missing information is caught
      and corrected for, analysis will be biased.
Secondary Data –
Limitations
   Are there confounding problems?
    – Sample selection bias?
    – Source choice bias?
    – In time series, did some observations
      drop out over time?
Secondary Data –
Limitations
   Are the data consistent/reliable?
    – Did variables drop out over time?
    – Did variables change in definition over
      time?
          E.g. number of years of education versus
           highest degree obtained.
Secondary Data –
Limitations
   Is the information exactly what you need?
    – In some cases, may have to use “proxy
      variables” – variables that may approximate
      something you really wanted to measure. Are
      they reliable? Is there correlation to what you
      actually want to measure?
    – E.g. gauging student interest in U.W. by their
      ranking on FAFSA – subject to gamesmanship.
Secondary Data –
Advantages
   No need to reinvent the wheel.
    – If someone has already found the data,
      take advantage of it.
Secondary Data –
Advantages
   It will save you money.
    – Even if you have to pay for access, often
      it is cheaper in terms of money than
      collecting your own data. (more on this
      later.)
Secondary Data –
Advantages
   It will save you time.
    – Primary data collection is very time
      consuming. (More on this later, too!)
Secondary Data –
Advantages
   It may be very accurate.
    – When especially a government agency
      has collected the data, incredible
      amounts of time and money went into it.
      It’s probably highly accurate.
Secondary Data –
Advantages
   It has great exploratory value
    – Exploring research questions and
      formulating hypothesis to test.
Primary Data

   Primary data – data you collect
Primary Data - Examples

   Surveys
   Focus groups
   Questionnaires
   Personal interviews
   Experiments and observational study
Primary Data -
Limitations
   Do you have the time and money for:
    – Designing your collection instrument?
    – Selecting your population or sample?
    – Pretesting/piloting the instrument to work
      out sources of bias?
    – Administration of the instrument?
    – Entry/collation of data?
Primary Data -
Limitations
   Uniqueness
    – May not be able to compare to other
      populations
Primary Data -
Limitations
   Researcher error
    – Sample bias
    – Other confounding factors
Data collection choice

   What you must ask yourself:
    – Will the data answer my research
      question?
Data collection choice

   To answer that
    – You much first decide what your research
      question is
    – Then you need to decide what
      data/variables are needed to scientifically
      answer the question
Data collection choice

   If that data exist in secondary form,
    then use them to the extent you can,
    keeping in mind limitations.
   But if it does not, and you are able to
    fund primary collection, then it is the
    method of choice.

More Related Content

What's hot

Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
Chintan Trivedi
 
336 Primary Data
336 Primary Data336 Primary Data
336 Primary DataFatema Ka
 
Data sources and collection methods
Data sources and collection methods Data sources and collection methods
Data sources and collection methods
Governance Asssessment Portal
 
Primary data and secondary data
Primary data and secondary dataPrimary data and secondary data
Primary data and secondary data
Sanjay Basukala
 
METHOD OF DATA COLLECTION
METHOD OF DATA COLLECTIONMETHOD OF DATA COLLECTION
METHOD OF DATA COLLECTION
PK Joshua
 
Primary and sec data
Primary and sec dataPrimary and sec data
Primary and sec dataAbdul Salim
 
Primary and secondary data (unit iii)
Primary and secondary data (unit iii)Primary and secondary data (unit iii)
Primary and secondary data (unit iii)
Shilpi Vaishkiyar
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
Aanya Kumar
 
Primary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & GokulPrimary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & Gokul
AJAY ANOJ KUMAR
 
What is primary data in detail
What is primary data in detailWhat is primary data in detail
What is primary data in detail
Ali Shah
 
Research methods
Research methods Research methods
Research methods Ash-Leigh
 
Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)
BoreyThai1
 
Methods for Collecting Data
Methods for Collecting DataMethods for Collecting Data
Methods for Collecting Data
RosamaeRos
 
Data collection methods in research
Data collection methods in researchData collection methods in research
Data collection methods in research
Vijay Thorat
 
Lesson 5 - Primary Research Methods 1
Lesson 5  - Primary Research Methods 1Lesson 5  - Primary Research Methods 1
Lesson 5 - Primary Research Methods 1Kavita Parwani
 
Rm 5 Methods Of Data Collection
Rm   5   Methods Of Data CollectionRm   5   Methods Of Data Collection
Rm 5 Methods Of Data Collectionitsvineeth209
 
Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
Mustafa Abd
 
PRIMARY & SECONDARY DATA COLLECTION
PRIMARY & SECONDARY DATA  COLLECTION PRIMARY & SECONDARY DATA  COLLECTION
PRIMARY & SECONDARY DATA COLLECTION
Suneal Saini
 
Data collection
Data collectionData collection
Data collection
Mira K Desai
 

What's hot (20)

Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
 
336 Primary Data
336 Primary Data336 Primary Data
336 Primary Data
 
Data sources and collection methods
Data sources and collection methods Data sources and collection methods
Data sources and collection methods
 
Primary data and secondary data
Primary data and secondary dataPrimary data and secondary data
Primary data and secondary data
 
METHOD OF DATA COLLECTION
METHOD OF DATA COLLECTIONMETHOD OF DATA COLLECTION
METHOD OF DATA COLLECTION
 
Primary and sec data
Primary and sec dataPrimary and sec data
Primary and sec data
 
Primary and secondary data (unit iii)
Primary and secondary data (unit iii)Primary and secondary data (unit iii)
Primary and secondary data (unit iii)
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
Primary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & GokulPrimary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & Gokul
 
What is primary data in detail
What is primary data in detailWhat is primary data in detail
What is primary data in detail
 
Research methods
Research methods Research methods
Research methods
 
Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)
 
Methods for Collecting Data
Methods for Collecting DataMethods for Collecting Data
Methods for Collecting Data
 
Data collection methods in research
Data collection methods in researchData collection methods in research
Data collection methods in research
 
Lesson 5 - Primary Research Methods 1
Lesson 5  - Primary Research Methods 1Lesson 5  - Primary Research Methods 1
Lesson 5 - Primary Research Methods 1
 
Rm 5 Methods Of Data Collection
Rm   5   Methods Of Data CollectionRm   5   Methods Of Data Collection
Rm 5 Methods Of Data Collection
 
Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
 
PRIMARY & SECONDARY DATA COLLECTION
PRIMARY & SECONDARY DATA  COLLECTION PRIMARY & SECONDARY DATA  COLLECTION
PRIMARY & SECONDARY DATA COLLECTION
 
Data collection
Data collectionData collection
Data collection
 
Method of Data Collection
Method of Data CollectionMethod of Data Collection
Method of Data Collection
 

Viewers also liked

Types of data by kamran khan
Types of data by kamran khanTypes of data by kamran khan
Types of data by kamran khan
kamran khan
 
Data Collection
Data CollectionData Collection
Economic analysis
Economic analysisEconomic analysis
Economic analysis
Charles Andrade
 
Basic econometrics lectues_1
Basic econometrics lectues_1Basic econometrics lectues_1
Basic econometrics lectues_1
Nivedita Sharma
 
Econometrics lecture 1st
Econometrics lecture 1stEconometrics lecture 1st
Econometrics lecture 1stIshaq Ahmad
 
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Muhammad Ali
 
Regression analysis
Regression analysisRegression analysis
Regression analysisRavi shankar
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 
Data Collection-Primary & Secondary
Data Collection-Primary & SecondaryData Collection-Primary & Secondary
Data Collection-Primary & SecondaryPrathamesh Parab
 
Methods of data collection
Methods of data collection Methods of data collection
Methods of data collection PRIYAN SAKTHI
 

Viewers also liked (10)

Types of data by kamran khan
Types of data by kamran khanTypes of data by kamran khan
Types of data by kamran khan
 
Data Collection
Data CollectionData Collection
Data Collection
 
Economic analysis
Economic analysisEconomic analysis
Economic analysis
 
Basic econometrics lectues_1
Basic econometrics lectues_1Basic econometrics lectues_1
Basic econometrics lectues_1
 
Econometrics lecture 1st
Econometrics lecture 1stEconometrics lecture 1st
Econometrics lecture 1st
 
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
Data Collection-Primary & Secondary
Data Collection-Primary & SecondaryData Collection-Primary & Secondary
Data Collection-Primary & Secondary
 
Methods of data collection
Methods of data collection Methods of data collection
Methods of data collection
 

Similar to Data collection methods

Rsearch methodology
Rsearch methodologyRsearch methodology
Rsearch methodology
neeann24
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
Nick Mj
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
Ajay Malpani
 
Collection of data
Collection of dataCollection of data
Collection of data
PoojaVishnoi7
 
QTB 4.pptx
QTB 4.pptxQTB 4.pptx
QTB 4.pptx
NehaFatima30
 
unit 2.3.ppt
unit 2.3.pptunit 2.3.ppt
unit 2.3.ppt
Sumit Kumar
 
datacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdfdatacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdf
kush545998
 
Data Collection Techniques.ppt
Data Collection Techniques.pptData Collection Techniques.ppt
Data Collection Techniques.ppt
PapuKumarNaik1
 
DATA-COLLECTION.pptx
DATA-COLLECTION.pptxDATA-COLLECTION.pptx
DATA-COLLECTION.pptx
JanineCallangan
 
T3 data collecting techniques
T3 data collecting techniquesT3 data collecting techniques
T3 data collecting techniques
kompellark
 
Chapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptxChapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptx
NahidulIslamIU
 
Research Data Management
Research  Data ManagementResearch  Data Management
Research Data Management
Mahmoud91Tx
 
Georgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullGeorgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 full
Sonya Sigler
 
Research Methodology Module-04
Research Methodology Module-04Research Methodology Module-04
Research Methodology Module-04
Kishor Ade
 
Data collection
Data collection Data collection
Data collection
Tarek Tawfik Amin
 

Similar to Data collection methods (20)

Rsearch methodology
Rsearch methodologyRsearch methodology
Rsearch methodology
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
S2
S2S2
S2
 
Managerialstatistics
ManagerialstatisticsManagerialstatistics
Managerialstatistics
 
Collection of data
Collection of dataCollection of data
Collection of data
 
QTB 4.pptx
QTB 4.pptxQTB 4.pptx
QTB 4.pptx
 
unit 2.3.ppt
unit 2.3.pptunit 2.3.ppt
unit 2.3.ppt
 
datacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdfdatacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdf
 
Data Collection and Diagnosis
Data Collection and DiagnosisData Collection and Diagnosis
Data Collection and Diagnosis
 
Data Collection Techniques.ppt
Data Collection Techniques.pptData Collection Techniques.ppt
Data Collection Techniques.ppt
 
DATA-COLLECTION.pptx
DATA-COLLECTION.pptxDATA-COLLECTION.pptx
DATA-COLLECTION.pptx
 
T3 data collecting techniques
T3 data collecting techniquesT3 data collecting techniques
T3 data collecting techniques
 
Chapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptxChapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptx
 
Research Data Management
Research  Data ManagementResearch  Data Management
Research Data Management
 
Georgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullGeorgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 full
 
Research Methodology Module-04
Research Methodology Module-04Research Methodology Module-04
Research Methodology Module-04
 
Data collection
Data collection Data collection
Data collection
 
The scientific method
The scientific methodThe scientific method
The scientific method
 
Method of Data Collection
Method of Data CollectionMethod of Data Collection
Method of Data Collection
 

Recently uploaded

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 

Recently uploaded (20)

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 

Data collection methods

  • 1. Data Collection Methods Pros and Cons of Primary and Secondary Data
  • 2. Where do data come from?  We’ve seen our data for this lab, all nice and collated in a database – from: – Insurance companies (claims, medications, procedures, diagnoses, etc.) – Firms (demographic data, productivity data, etc.)
  • 3. Where do data come from?  Take a step back – if we’re starting from scratch, how do we collect / find data? – Secondary data – Primary data
  • 4. Secondary Data  Secondary data – data someone else has collected – This is what you were looking for in your assignment.
  • 5. Secondary Data – Examples of Sources  County health departments  Vital Statistics – birth, death certificates  Hospital, clinic, school nurse records  Private and foundation databases  City and county governments  Surveillance data from state government programs  Federal agency statistics - Census, NIH, etc.
  • 6. Secondary Data – Limitations  What did you find on the frustrating side as you looked for data on the state’s websites?
  • 7. Secondary Data – Limitations  When was it collected? For how long? – May be out of date for what you want to analyze. – May not have been collected long enough for detecting trends. – E.g. Have new anticorruption laws impacted Russia’s government accountability ratings?
  • 8. Secondary Data – Limitations  Is the data set complete? – There may be missing information on some observations – Unless such missing information is caught and corrected for, analysis will be biased.
  • 9. Secondary Data – Limitations  Are there confounding problems? – Sample selection bias? – Source choice bias? – In time series, did some observations drop out over time?
  • 10. Secondary Data – Limitations  Are the data consistent/reliable? – Did variables drop out over time? – Did variables change in definition over time?  E.g. number of years of education versus highest degree obtained.
  • 11. Secondary Data – Limitations  Is the information exactly what you need? – In some cases, may have to use “proxy variables” – variables that may approximate something you really wanted to measure. Are they reliable? Is there correlation to what you actually want to measure? – E.g. gauging student interest in U.W. by their ranking on FAFSA – subject to gamesmanship.
  • 12. Secondary Data – Advantages  No need to reinvent the wheel. – If someone has already found the data, take advantage of it.
  • 13. Secondary Data – Advantages  It will save you money. – Even if you have to pay for access, often it is cheaper in terms of money than collecting your own data. (more on this later.)
  • 14. Secondary Data – Advantages  It will save you time. – Primary data collection is very time consuming. (More on this later, too!)
  • 15. Secondary Data – Advantages  It may be very accurate. – When especially a government agency has collected the data, incredible amounts of time and money went into it. It’s probably highly accurate.
  • 16. Secondary Data – Advantages  It has great exploratory value – Exploring research questions and formulating hypothesis to test.
  • 17. Primary Data  Primary data – data you collect
  • 18. Primary Data - Examples  Surveys  Focus groups  Questionnaires  Personal interviews  Experiments and observational study
  • 19. Primary Data - Limitations  Do you have the time and money for: – Designing your collection instrument? – Selecting your population or sample? – Pretesting/piloting the instrument to work out sources of bias? – Administration of the instrument? – Entry/collation of data?
  • 20. Primary Data - Limitations  Uniqueness – May not be able to compare to other populations
  • 21. Primary Data - Limitations  Researcher error – Sample bias – Other confounding factors
  • 22. Data collection choice  What you must ask yourself: – Will the data answer my research question?
  • 23. Data collection choice  To answer that – You much first decide what your research question is – Then you need to decide what data/variables are needed to scientifically answer the question
  • 24. Data collection choice  If that data exist in secondary form, then use them to the extent you can, keeping in mind limitations.  But if it does not, and you are able to fund primary collection, then it is the method of choice.