Understanding ICPSR
Four “Tours” of ICPSR Research Data Services &
Education Resources
Fall 2014
What’s Included – Four Tours in One
• What, Why, & Who of “ICPSR”
– Mission and usage of ICPSR
– ICPSR’s past & present
– ...
Tour I: The What, Why, & Who of ICPSR
ICPSR’s Mission
ICPSR advances and expands social and behavioral research,
acting as a global leader in data stewardship a...
What We Do – It’s About Data!
• Seek research data and pertinent
documents from researchers (PIs,
research agencies, gover...
Why People Use ICPSR
• Write articles, papers, or theses using real
research data
• Conduct secondary research (analysis) ...
Who uses ICPSR?
- Over 40 Disciplines/Fields Supported -
• One of the world’s oldest and largest social
science data archives, est. 1962
• Data distributed on punch cards, then re...
Present Volumes of Activity
• 7,591 studies: 64,926 datasets: 177,656 files
available for download
– 1,194 restricted stud...
Most Popular Downloads this Past Year:
• National Longitudinal Study of Adolescent Health
• National Survey on Drug Use an...
Benefits of Membership in ICPSR
• Data access: 4,872 studies associated with 28,475 curated datasets including:
– General ...
Tour II: Finding Research Data for Analysis
If you recall: Most Popular Downloads this Past Year:
• National Longitudinal Study of Adolescent Health
• National Survey...
What’s in a “Download?”
• Documentation files - pdfs
– Questionnaire
– Codebook
– Description & Citation
• Data in many fo...
How does One Download Data?
The MyData Account
• MyData account – operates as authentication and like a
shopping cart!
• A...
Enter Our Front Office: ICPSR Website
http://www.icpsr.umich.edu/
The Challenge – Hoards of Data & Metadata
How does one make sense of:
• 7,600 studies
• 65,000 datasets
• 177,700 files
• ...
Search Strategies to Find & Analyze Data
ICPSR’s Thematic Data Collections
– another search strategy
• ICPSR’s Thematic Collections are
archives organized around
s...
The Study Home Page:
Where Documentation Lives!
It’s really a searchable database
• Containing over 64,600 citations
of known published and
unpublished works resulting
fr...
Data Tools: Social Science Variables Database
Enables ICPSR users to:
• Search & Compare Variables across
datasets
• Assis...
Analyze & Compare Variables
Data Tools: Online “On-Demand” Analysis
“View SDA Studies”
SDA Output
Supporting the Data
• Free user support
• The Get Help Page offers:
– User support (at ICPSR) email and phone contact
info...
Tour III: ICPSR in Education
ICPSR Summer Program in Quantitative
Methods
• Instruction on the tools and practices needed to analyze data
• For those w...
Teaching Resources to
Bring Data Into the Classroom
• Easy to use features of ICPSR’s website in classes
– Social Science ...
Bring Data Into the Classroom
Data-Driven Learning Guides – over 50 stand-alone
exercises that teach social & behavioral science
concepts via standardiz...
Crosstab Assignment Builder – a utility to build simple
tables for analysis that the instructor can share with
students
Online Modules
Student Internships & Research Opportunities
• Paid Student
Internships focusing
on investigating social
& behavioral scie...
Tour IV: Sustainable Data
Management & Curation
First - The Concept of “Data Curation”
• Curation, from the Latin "to care," is the process used to add value to
data, max...
Two ‘Recent’ Moments in Federal Data
Sharing History
• NSF: January 2011 – requirement of data
management plans
• OSTP: Fe...
The details are still developing but the
focus for research data sharing includes:
1. Maximize public access (includes dis...
What is good data sharing?
The goals are simple:
• Data gets used (maximizes taxpayer
investment & credits investigators)
...
ICPSR offers Three Sustainable Data
Sharing Models to Fulfill Requirements
• Fee-for-access model (membership archive)
• A...
ICPSR’s Fee-for-Access Data Sharing
• Funding is maintained by annual membership (subscription) fees
charged to institutio...
ICPSR’S Agency-funded Data Sharing
• Agency sponsors/funds (ongoing) data curation & sharing enabling the
public to access...
ICPSR’s Fee-for-Deposit Data Sharing
- openICPSR -
• Depositor (individual or entity) pays for data
to be curated and stor...
Data Management & Curation Resources
http://www.icpsr.umich.edu/datamanagement/
Purpose of Data Management Plans
• Data management plans describe how researchers
will provide for long-term preservation ...
Data Management
Plan Resources
Guidelines for Download
And still more guidelines after the
project is awarded:
• Guide emphasizes
preparation for data
sharing throughout
the pro...
Sharing Restricted-Use Data
• Data with disclosure risk –
potential to identify a research
subject
• Data with highly sens...
Common Objection/Misperception:
“My data are too sensitive to share. . .”
• ICPSR has been sharing restricted-use data for...
Reality: Restricted-use data can be
effectively shared with the public
• Through the use of a virtual data enclave where
t...
The Visual
For More Information on ICPSR:
• Explore the website - www.icpsr.umich.edu
• Sign up for our email announcements -
www.icp...
Upcoming SlideShare
Loading in...5
×

Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Educational Resources

573

Published on

This is ICPSR's core workshop deck designed to introduce, remind, and refresh your knowledge of ICPSR. It contains four "tours" or sub-presentations describing ICPSR's general reason for being, it's social and behavioral research data complete with search strategies, its training, educational, and instructional resources, and its data management and curation services, data repository options, and support resources (content and budget estimates) for those writing grant proposals.

Published in: Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
573
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
7
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • For this slide, we tend to conduct live demos, but a few notes here to get you started:

    This is the front page of the data search. The search box, Find Data, works much like other search engines. Note however that unfortunately, this search does not accommodate (or correct) for misspellings. Not receiving any results? Check your spelling. If using names, names like Bob and will not bring up Robert or Timothy. Correct name references must be input.

    The page also offers several pre-programmed ways to obtain results – by topic, by geography, or by studies that have learning guides (teaching resources) associated with the study.
  • Link to the Thematic Collections page: http://www.icpsr.umich.edu/icpsrweb/content/membership/partners/archives.html
  • The Study Home Page is also a great “search” strategy. Click into any study, and you will find all the information we have been able to gather about the study.

    Use the Summary for a quick review, then click into the “view details” to understand the full scope of the research – methodology, survey type, sampling, scope, geography, subject terms used to tag the dataset, PI, and much more. You’ll also find a link to all of the journal articles, reports, and presentations we’ve been able to link to the dataset (where the data was used as part of the analysis within the article). This is a great way to understand whether this data is for you.
  • What’s in the bibliography collection? Published & unpublished works . . .
    using data in the ICPSR holdings as the primary data source
    using ICPSR data in a comparison with the primary dataset investigated
    "about" an ICPSR dataset or study series
    The link to Find Publications is found from the Find & Analyze Data page or directly here: http://www.icpsr.umich.edu/icpsrweb/ICPSR/citations/index.jsp
  • Tool for teaching
    Research Methods:
    Concept operationalization
    Effect of question wording, context, and answer categories on variable distributions
    Substantive classes:
    Cultural / social changes reflected in different question wordings, or elicited answers (longitudinal or time series data)
    Current content:
    Over 76 percent of ICPSR holdings
    Approx. 4 million variables
    Continues to grow by including
    All new releases, if suitable
    Retrofits as made available by small-scale projects
  • ICPSR is working with Berkeley to render SDA in a format that will allow us to customize the appearance of the interface and results.

    As of August 2014: 943 studies available online.

    View SDA studies here: http://www.icpsr.umich.edu/icpsrweb/ICPSR/access/sda.jsp
  • Gender variable by occasions smoked marijuana variable
  • As you seen, ICPSR doesn’t just deliver data. We surround that data with tools and services that support its use and interpretation.

  • Instructional materials are another way to “share” research data – in addition to educating the next generation.
  • These are ICPSR’s “Thematic Collections”
  • openICPSR is a unique public data-sharing service:
    Where the deposit is reviewed by professional data curators who are experts in developing metadata (tags) for the social and behavioral sciences = discoverable
    With an immediate distribution network of over 750 institutions looking for research data, that has powerful search tools, and a data catalog indexed by major search engines = usage
    Sustained by a respected organization with over 50 years of experience in reliably protecting research data = sustainable
    Prepared to accept and disseminate sensitive and/or restricted-use data in the public-access environment = protection of research subjects
  • A collection of resources (links) to assist in data management plans for grant proposals
    Tools to prepare plans (templates & sample plans)
    Contact information for plan advice
  • 22 pages of guidelines and references even including a sample plan (boilerplate!) available for download.

    Link to pdf document: http://www.icpsr.umich.edu/files/datamanagement/DataManagementPlans-All.pdf
  • Pdf link to the data prep guide: http://www.icpsr.umich.edu/files/deposit/dataprep.pdf

    More information on data preparation for archiving: http://www.icpsr.umich.edu/icpsrweb/content/deposit/guide/
  • Sensitive personal information isn’t about names, addresses, credit card numbers, or other direct identifying information. Research scientists should never, never, ever submit this type of information to any hosted service – ever. What we’re talking about is highly personal information (topics) within research data that may include past/present drug use, illegal activities, or perhaps sexual habits.
  • We’re currently adding about 50 new contracts each month.
  • We are in the development phase of technology for disseminating video research data!
  • Understanding ICPSR - An Orientation and Tours of ICPSR Data Services and Educational Resources

    1. 1. Understanding ICPSR Four “Tours” of ICPSR Research Data Services & Education Resources Fall 2014
    2. 2. What’s Included – Four Tours in One • What, Why, & Who of “ICPSR” – Mission and usage of ICPSR – ICPSR’s past & present – Benefits of membership • Finding Research Data for Analysis – Scope & search strategies – Data tools • ICPSR in Education – ICPSR Summer Program – Teaching resources – Student internships & research opportunities • Sustainable Data Management & Curation – Fulfilling grant requirements – Deposit and curation options and resources – Sharing restricted-use data
    3. 3. Tour I: The What, Why, & Who of ICPSR
    4. 4. ICPSR’s Mission ICPSR advances and expands social and behavioral research, acting as a global leader in data stewardship and providing rich data resources and responsive educational opportunities for present and future generations. Three Pillars for Implementing our Mission 1. Share data – maximize access to research data for analysis and publications 2. Educate and train current & future research methodologists & data scientists 3. Provide data management & curation services to fulfill grant requirements and assure long-term viability of research data
    5. 5. What We Do – It’s About Data! • Seek research data and pertinent documents from researchers (PIs, research agencies, government) • Process, describe (tag), and preserve the data and documents • Disseminate (share) data • Provide education, training, & instructional resources • Offer grant-writing and fulfillment support and data management services
    6. 6. Why People Use ICPSR • Write articles, papers, or theses using real research data • Conduct secondary research (analysis) to support findings of current research or to generate new findings • Study or teach quantitative methods (data analysis techniques) • Study data curation and repository management • Use as intro material in grant proposals • Preserve/disseminate primary research data – Fulfill data management plan (grant) and data sharing requirements
    7. 7. Who uses ICPSR? - Over 40 Disciplines/Fields Supported -
    8. 8. • One of the world’s oldest and largest social science data archives, est. 1962 • Data distributed on punch cards, then reel- to-reel tape, now: – Data available on demand – Over 8,870 studies with over 65,000 data sets • Membership organization among 22 universities, now: – Currently about 750 members world-wide – Federal funding of public-access collections ICPSR’s Past & Present
    9. 9. Present Volumes of Activity • 7,591 studies: 64,926 datasets: 177,656 files available for download – 1,194 restricted studies (6,359 datasets) • FY 2014 – 683,204 datasets downloaded – 38,924 active MyData accounts – 457,449 website visits/300,198 unique visitors – 1,040+ Summer Program attendees
    10. 10. Most Popular Downloads this Past Year: • National Longitudinal Study of Adolescent Health • National Survey on Drug Use and Health • General Social Surveys (1972-2012 Cumulative) • National Survey of Midlife Development in the US (MIDUS) • Children of Immigrants Longitudinal Study (CILS) • Chinese Household Income Project • Drug Abuse Warning Network (DAWN) • India Human Development Survey • National Prisoner Statistics • National Health and Social Life Survey • Health Behavior in School-Aged Children • American National Election Study • Education Longitudinal Study (ELS)
    11. 11. Benefits of Membership in ICPSR • Data access: 4,872 studies associated with 28,475 curated datasets including: – General Social Survey – American National Election Survey – Education Longitudinal Survey – New Family Structures Study • Teaching resources (Data-Driven Learning Guides) available exclusively to ICPSR members • Discounted ICPSR Summer Program tuition • Discounts on deposit fees related to openICPSR – ICPSR’s public data access collection • Menu of data usage reports across your institution immediately available electronically • Data management plan and budget estimate support for grant proposals • Access to a global network of over 750 institutions of all sizes interested in research data, data curation, and training
    12. 12. Tour II: Finding Research Data for Analysis
    13. 13. If you recall: Most Popular Downloads this Past Year: • National Longitudinal Study of Adolescent Health • National Survey on Drug Use and Health • General Social Surveys (1972-2012 Cumulative) • National Survey of Midlife Development in the US (MIDUS) • Children of Immigrants Longitudinal Study (CILS) • Chinese Household Income Project • Drug Abuse Warning Network (DAWN) • India Human Development Survey • National Prisoner Statistics • National Health and Social Life Survey • Health Behavior in School-Aged Children • American National Election Study • Education Longitudinal Study (ELS)
    14. 14. What’s in a “Download?” • Documentation files - pdfs – Questionnaire – Codebook – Description & Citation • Data in many forms! – SPSS, SAS, Stata – ASCII
    15. 15. How does One Download Data? The MyData Account • MyData account – operates as authentication and like a shopping cart! • Authenticate once every six months on campus and you can carry it with you
    16. 16. Enter Our Front Office: ICPSR Website http://www.icpsr.umich.edu/
    17. 17. The Challenge – Hoards of Data & Metadata How does one make sense of: • 7,600 studies • 65,000 datasets • 177,700 files • Millions of variables • 64,600 bibliographic citations
    18. 18. Search Strategies to Find & Analyze Data
    19. 19. ICPSR’s Thematic Data Collections – another search strategy • ICPSR’s Thematic Collections are archives organized around specific topics • Most collections are funded by government agencies or foundations and therefore data are open to the public • Data from all collections, including the membership archive, are searchable by using the search found on ICPSR’s Find & Analyze page • Those desiring to search for data only within a particular collection should use the search provided within that collection
    20. 20. The Study Home Page: Where Documentation Lives!
    21. 21. It’s really a searchable database • Containing over 64,600 citations of known published and unpublished works resulting from analyses of data archived at ICPSR • That can generate study bibliographies associating each study with the literature about it • Included in one integrated search on the ICPSR website Data Tools: Find Publications The Bibliography of Data-related Literature
    22. 22. Data Tools: Social Science Variables Database Enables ICPSR users to: • Search & Compare Variables across datasets • Assists in: – Data discovery – Comparison/harmonization projects – Data harvesting & data analysis – Question mining for designing new research – Research methods & substantive courses instruction
    23. 23. Analyze & Compare Variables
    24. 24. Data Tools: Online “On-Demand” Analysis “View SDA Studies”
    25. 25. SDA Output
    26. 26. Supporting the Data • Free user support • The Get Help Page offers: – User support (at ICPSR) email and phone contact information – Data User Help Center: Short Tutorials & Webinars available 24/7 (via ICPSR’s YouTube channel) – Local Support: Who to contact at your local institution – Glossary of Terms – Social Networks: Where you can find us on YouTube, Facebook, Twitter, LinkedIn, Slideshare, and more
    27. 27. Tour III: ICPSR in Education
    28. 28. ICPSR Summer Program in Quantitative Methods • Instruction on the tools and practices needed to analyze data • For those with math phobia and those with advanced analysis skills • 3-5 day workshops and 4-8 week courses • Primarily held in Ann Arbor, MI, on the campus of The University of Michigan, but some courses on other campuses also • http://www.icpsr.umich.edu/sumprog/
    29. 29. Teaching Resources to Bring Data Into the Classroom • Easy to use features of ICPSR’s website in classes – Social Science Variables Database – Bibliography of Data-Related Literature – SDA – Online Analysis • Additionally, in partnership with teaching faculty, ICPSR has developed: – Short Exercises – the DDLGs – Online teaching modules – Online tutorials
    30. 30. Bring Data Into the Classroom
    31. 31. Data-Driven Learning Guides – over 50 stand-alone exercises that teach social & behavioral science concepts via standardized, ready-to-go, online analysis
    32. 32. Crosstab Assignment Builder – a utility to build simple tables for analysis that the instructor can share with students
    33. 33. Online Modules
    34. 34. Student Internships & Research Opportunities • Paid Student Internships focusing on investigating social & behavioral sciences research – an REU • Research paper competitions -- a research journal experience & cash prizes!
    35. 35. Tour IV: Sustainable Data Management & Curation
    36. 36. First - The Concept of “Data Curation” • Curation, from the Latin "to care," is the process used to add value to data, maximize access, and ensure long-term preservation • Data curation is akin to work performed by an art or museum curator. – Data are organized, described, cleaned, enhanced, and preserved for public use, much like the work done on paintings or rare books to make the works accessible to the public now and in the future • Curation provides meaningful and enduring access to data • Data curation is the foundation for effective, long-term data sharing
    37. 37. Two ‘Recent’ Moments in Federal Data Sharing History • NSF: January 2011 – requirement of data management plans • OSTP: February 2013 – Memo with subject “Increasing Access to the Results of Federally Funded Scientific Research”
    38. 38. The details are still developing but the focus for research data sharing includes: 1. Maximize public access (includes discoverability) 2. Protect confidentiality and privacy 3. Allow for inclusion of costs in proposals for federal funding of scientific research 4. Appropriate evaluation of submitted data plans 5. Compliance mechanisms 6. Cooperation with the private sector 7. Appropriate attribution 8. Long term preservation and sustainability
    39. 39. What is good data sharing? The goals are simple: • Data gets used (maximizes taxpayer investment & credits investigators) • Available today and into the future • Research respondent protection
    40. 40. ICPSR offers Three Sustainable Data Sharing Models to Fulfill Requirements • Fee-for-access model (membership archive) • Agency model (agency or foundation funds public access) • Fee-for-deposit model (researcher writes fee into grant and pays at deposit to fund public access)
    41. 41. ICPSR’s Fee-for-Access Data Sharing • Funding is maintained by annual membership (subscription) fees charged to institutions; individuals at member institutions have free (open) access to data • Pooled (ongoing) fees are used to acquire, curate, and maintain the service • Datasets can be acquired by non-members for a fee
    42. 42. ICPSR’S Agency-funded Data Sharing • Agency sponsors/funds (ongoing) data curation & sharing enabling the public to access without charge • The archive is hosted by ICPSR where the public can easily discover and access data and restricted-use data can also be securely shared • Agency directs data selection and compliance policies
    43. 43. ICPSR’s Fee-for-Deposit Data Sharing - openICPSR - • Depositor (individual or entity) pays for data to be curated and stored – a fee at deposit • Deposit fees to be written into the grant application • Incoming deposit fees sustain the service and the professionals behind it • Deposits are bit-level to-date, but fully curated deposits are encouraged and welcomed!
    44. 44. Data Management & Curation Resources http://www.icpsr.umich.edu/datamanagement/
    45. 45. Purpose of Data Management Plans • Data management plans describe how researchers will provide for long-term preservation of, and access to, scientific data in digital formats. • Data management plans provide opportunities for researchers to manage and curate their data more actively from project inception to completion.
    46. 46. Data Management Plan Resources
    47. 47. Guidelines for Download
    48. 48. And still more guidelines after the project is awarded: • Guide emphasizes preparation for data sharing throughout the project • Available online and via download (pdf)
    49. 49. Sharing Restricted-Use Data • Data with disclosure risk – potential to identify a research subject • Data with highly sensitive personal information What is Restricted-Use Data?
    50. 50. Common Objection/Misperception: “My data are too sensitive to share. . .” • ICPSR has been sharing restricted-use data for over a decade via three methods: – Secure Download – Virtual Data Enclave – Physical Enclave • ICPSR stores & shares over 6,400 restricted- use datasets associated with over 2,000 ‘active’ restricted-use data contracts
    51. 51. Reality: Restricted-use data can be effectively shared with the public • Through the use of a virtual data enclave where the data never leave the server • Where there is a process (and understanding!) to garner IRB approval from the requesting scientist’s university • Where there is a system, technology, data professionals, and collaboration space in place to disseminate (expensive to build!) • Because federal agencies do allow for an incremental charge to the data requestor to offset marginal costs
    52. 52. The Visual
    53. 53. For More Information on ICPSR: • Explore the website - www.icpsr.umich.edu • Sign up for our email announcements - www.icpsr.umich.edu/icpsrweb/membership/lists/index.jsp • “Like” ICPSR on Facebook/follow ICPSR on Twitter • Attend or view our webinars (open to the public!) • Find our presentations on www.slideshare.net – user: icpsr • Contact user support – netmail@icpsr.umich.edu
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×