SlideShare a Scribd company logo
1 of 32
BADM 298: DR.
MOTHKOVICH
Let’s Find Some Datasets!
Prof. O’Connor
ASSIGNMENT
•Locate dataset
•800-5,000 rows, 10-20 parameters, (more is ok)
•5-7 of the parameters must be numeric
ASSIGNMENT PT.2
•You will need to find information about your
topic
•Online
•Library databases & news sources
Checkout the articles tab in the 299/298 BADM
course guide
ASSIGNMENT PART 3
•Write 8-10 page paper (single spaced)
• No more the 3 charts graphs or figures in the paper the rest
must be moved to the appendix
• Cover sheet, table of contents, appendix, and references are
not considered part of the 8-10 page paper.
CITATION
You will be citing your paper in MLA or APA depending on
your topic
MLA: Literature, Languages, Cultural Studies, & Social
Sciences
APA: Education, Psychology, Sciences, & Medicine
• Cite a Dataset
• Citation Guide
DATASET RESEARCH
You will need to locate
1. Description of the nature of the data
4. An overview of the Data Collection
Methodology
Don’t for get to…
• Summarize the information in the beginning of your paper
• & Cite it!
TOPIC/EXPLORATION
Start Your Search With…
•General Topic/Area of Interest
•Not all topics work well for large datasets. Pick
a couple of topics you might be interested in
as back-up.
Examples of Large Data Set Friendly Topics
• Homelessness
• Wildfire
• Climate Change
• Carbon Footprint
• Education
• Social Media
• Music
• Public Safety & Crime
• COVID-19
• Electricity
• Sexually Transmitted
Illness
• Mental Illness &
Disabilities
ORIGINAL ANALYSIS VS. REVIEW
You Are Doing Original Analysis Using Pre-Existing Data
ORIGINAL ANALYSIS
• Examines existing data
• Creates something new
• Adds to the academic
conversation
REVIEW
• Summarizes what is
already known about the
data
• No original analysis
• Does not answer new
previously unanswered
questions about the data
EXCEL
Yes
File extensions
.CSV, .XLSX,
.XLSM, .XLSB,
.ODS
Excel icon
File Conversion
Required
or Add-ins
File extensions
.RDF, .JSON,
.XML, .XLAM,
.HTLM
Don’t Use
File extensions
.PDF, .JPEG,
All Image files
Text files
• ZIP Folder-contains multiple files and often many file
types
FILE SIZES
•Check the file size if possible before
downloading
•Less than 100 KB not usually worth checking
•1,200 KB or larger is promising
•100,000 or larger is fine you can use a
subset of the data if necessary
•Megabyte is one thousand times bigger than
a Kilobyte. A file measured in Megabytes MB
is typically a good sign.
Google Dataset-Search
• Go to google dataset search
GOOGLE Dataset Search:
Best Sources For Datasets
• data.world
• Kaggle.com
• worldbank.org
• dataverse.harvard.edu
• healthdata.gov
GOOGLE DATASET SEARCH:
Select tabular and free!
Search a specific website
using site:last part of URL
EXAMPLE: kaggle.com
Data Manipulation Tools (online websites)
• Requires user to make a series of choices about
years, geography, or special aspects of the data.
•Removes user from the raw data. End result is graph,
map, or other visual, although sometimes a small
raw dataset is created.
•Avoid data manipulation tools for this assignment!
One exception would be for those interested in using
the Census Data Explorer. Please set up a Book a
Librarian Appointment with me if interested in
Example of data manipulation tool:
DON’T USE!
DATA.GOV
DATA.GOV
Retrieve an excel file
HEALTH DATA.GOV
healthdata.gov
Download dataset from healthdata.gov
DATASET QUALITY
Authorship
• Who published the data/reliable source?
• When possible get the dataset from the original source
Completeness
• Avoid datasets with lots of blanks, NA, X, etc.
• Avoid datasets that are missing critical information such as author/source, methodology, data
dictionary etc.
Relevance
• Is the information current?
• Can you write an 8-10 page paper about the topic of your dataset?
• How many of the Parameters are useable?
• At least 9 useable columns/5 numeric
• How much cleanup will I need to do before I can use this data?
• Lots of blank columns
• Columns without labels
• Dataset is too large example 80 columns by 200,000 rows
CREATING A SUBSET
• Too many rows?
• Try limiting by year, date, geography, or topic
• Too many columns?
• Are there some categories you can eliminate because you are less
interested in analyzing them/discussing them in your paper?
• Eliminate rows and columns that aren’t helpful for analysis such
as…
• Empty rows and columns
• Columns that you don’t understand (check the dataset index and
description for information!)
HOW DO I LOCATE INFORMATION ABOUT
MY DATASET?
•Locate
• Abstract
• Methodology
• and/or documents included with the data
• Look for URLs these may take you to the original
content or creators page
WHAT IF A WRITE-UP IS NOT INCLUDED WITH
THE DATASET?
• Search for Identifying Information
• Authorship (person, agency, organization)
• See if a URL to more information is included with data
CREATING A SUBSET OF YOUR DATA
(datasets that are too large or need some clean-up)
INFORMATION SOURCES
• Browse the open web (.gov, .edu, and .org) are generally
considered more reliable
• Academic Search premier is our biggest general database
• Use the “Articles tab” in the Business 298/299 Guide
Go to the subject guides on the library’s home page and select
the guide that relates to your topic
Example: Education Subject Guide , Psychology Subject
Guide, etc.
Contact Me For Assistance
• Book A Librarian Appointment You must have a UofSC Aiken
email to book an appointment. This is for a one-on-one virtual
appointment for help with locating raw data sets or data
citation.
• Email: susano@usca.edu
• Phone: 803-641-3261
• In-person appointments are also available

More Related Content

What's hot

Database tutorial2
Database tutorial2Database tutorial2
Database tutorial2jenrudolph
 
Database tutorial
Database tutorialDatabase tutorial
Database tutoriallrodio
 
Scopus Overview
Scopus OverviewScopus Overview
Scopus OverviewFSC632
 
MSc Patient safety: information resources - Aberdeen 0910
MSc Patient safety: information resources - Aberdeen 0910MSc Patient safety: information resources - Aberdeen 0910
MSc Patient safety: information resources - Aberdeen 0910Susan McCourt
 
Introductiontocataloganddatabases
IntroductiontocataloganddatabasesIntroductiontocataloganddatabases
Introductiontocataloganddatabasescarolyn oldham
 
Bushra bioinformatic Presentation
Bushra bioinformatic PresentationBushra bioinformatic Presentation
Bushra bioinformatic PresentationNaveed Akhtar Isamu
 
Engwr300 Hansen
Engwr300 HansenEngwr300 Hansen
Engwr300 Hansenkarlsen
 
Literature search PPT Dr Mathew Joseph
Literature search PPT Dr Mathew JosephLiterature search PPT Dr Mathew Joseph
Literature search PPT Dr Mathew JosephMathew Joseph
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategyjmunks
 
dkNET Literature Search Tutorial
dkNET Literature Search TutorialdkNET Literature Search Tutorial
dkNET Literature Search TutorialdkNET
 
AEA august coffeebreak presentation
AEA august coffeebreak presentationAEA august coffeebreak presentation
AEA august coffeebreak presentationMatthew Von Hendy
 
Ap aforlibguide2012 2
Ap aforlibguide2012 2Ap aforlibguide2012 2
Ap aforlibguide2012 2Mark Felvus
 

What's hot (19)

Database tutorial2
Database tutorial2Database tutorial2
Database tutorial2
 
Database tutorial
Database tutorialDatabase tutorial
Database tutorial
 
Educ Sept2010
Educ Sept2010Educ Sept2010
Educ Sept2010
 
Scopus Overview
Scopus OverviewScopus Overview
Scopus Overview
 
MSc Patient safety: information resources - Aberdeen 0910
MSc Patient safety: information resources - Aberdeen 0910MSc Patient safety: information resources - Aberdeen 0910
MSc Patient safety: information resources - Aberdeen 0910
 
Web of Science
Web of ScienceWeb of Science
Web of Science
 
Finding information
Finding informationFinding information
Finding information
 
Introductiontocataloganddatabases
IntroductiontocataloganddatabasesIntroductiontocataloganddatabases
Introductiontocataloganddatabases
 
Bushra bioinformatic Presentation
Bushra bioinformatic PresentationBushra bioinformatic Presentation
Bushra bioinformatic Presentation
 
Engwr300 Hansen
Engwr300 HansenEngwr300 Hansen
Engwr300 Hansen
 
LIBRARY RESEARCH SKILLS IN ENGINEERING
LIBRARY RESEARCH SKILLS IN ENGINEERINGLIBRARY RESEARCH SKILLS IN ENGINEERING
LIBRARY RESEARCH SKILLS IN ENGINEERING
 
Library research skills in engineering
Library research skills in engineeringLibrary research skills in engineering
Library research skills in engineering
 
Literature search PPT Dr Mathew Joseph
Literature search PPT Dr Mathew JosephLiterature search PPT Dr Mathew Joseph
Literature search PPT Dr Mathew Joseph
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategy
 
dkNET Literature Search Tutorial
dkNET Literature Search TutorialdkNET Literature Search Tutorial
dkNET Literature Search Tutorial
 
AEA august coffeebreak presentation
AEA august coffeebreak presentationAEA august coffeebreak presentation
AEA august coffeebreak presentation
 
Ap aforlibguide2012 2
Ap aforlibguide2012 2Ap aforlibguide2012 2
Ap aforlibguide2012 2
 
Educ 5P63
Educ 5P63Educ 5P63
Educ 5P63
 
Analysis on semantic web layer cake entities
Analysis on semantic web layer cake entitiesAnalysis on semantic web layer cake entities
Analysis on semantic web layer cake entities
 

Similar to Library Instruction for BADM 298

Library instruction for BADM 298 with Dr. Mothkovich
Library instruction for BADM 298 with Dr. MothkovichLibrary instruction for BADM 298 with Dr. Mothkovich
Library instruction for BADM 298 with Dr. MothkovichSusieOConnor2
 
Qualitem - Large List Support - SharePoint Saturday
Qualitem - Large List Support - SharePoint SaturdayQualitem - Large List Support - SharePoint Saturday
Qualitem - Large List Support - SharePoint SaturdayRick Rosato
 
Math history r
Math history rMath history r
Math history rJane Wu
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdfngVnThng12
 
Getting the most out of secondary research
Getting the most out of secondary researchGetting the most out of secondary research
Getting the most out of secondary researchctkmedia
 
Arlington high school sixties spring 2015
Arlington high school   sixties spring 2015Arlington high school   sixties spring 2015
Arlington high school sixties spring 2015k-baril
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYChris Okiki
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012ECNOfficer
 
Tips for searching for information
Tips for searching for informationTips for searching for information
Tips for searching for informationKatie Wiese
 
So you want to do a meta analysis
So you want to do a meta analysisSo you want to do a meta analysis
So you want to do a meta analysisPeggy Tyler
 
How to build a data dictionary
How to build a data dictionaryHow to build a data dictionary
How to build a data dictionaryPiotr Kononow
 
Discoverer_Revised
Discoverer_RevisedDiscoverer_Revised
Discoverer_RevisedPaul Stella
 
Epidata presentation course for heath science
Epidata presentation course for heath scienceEpidata presentation course for heath science
Epidata presentation course for heath scienceMitikuTeka1
 
Analyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAnalyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAleatha Parker-Wood
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basicNivaTripathy2
 
Apples to Apples How to Organize Content with Metadata in SharePoint
Apples to Apples How to Organize Content with Metadata in SharePointApples to Apples How to Organize Content with Metadata in SharePoint
Apples to Apples How to Organize Content with Metadata in SharePointWilliam Huneycutt, II
 
ICPSR Find & Analyze Data
ICPSR Find & Analyze DataICPSR Find & Analyze Data
ICPSR Find & Analyze Datasummerdurrant
 

Similar to Library Instruction for BADM 298 (20)

Library instruction for BADM 298 with Dr. Mothkovich
Library instruction for BADM 298 with Dr. MothkovichLibrary instruction for BADM 298 with Dr. Mothkovich
Library instruction for BADM 298 with Dr. Mothkovich
 
Qualitem - Large List Support - SharePoint Saturday
Qualitem - Large List Support - SharePoint SaturdayQualitem - Large List Support - SharePoint Saturday
Qualitem - Large List Support - SharePoint Saturday
 
Math history r
Math history rMath history r
Math history r
 
01-Introduction.pdf
01-Introduction.pdf01-Introduction.pdf
01-Introduction.pdf
 
Databases
DatabasesDatabases
Databases
 
Getting the most out of secondary research
Getting the most out of secondary researchGetting the most out of secondary research
Getting the most out of secondary research
 
Arlington high school sixties spring 2015
Arlington high school   sixties spring 2015Arlington high school   sixties spring 2015
Arlington high school sixties spring 2015
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
 
Tips for searching for information
Tips for searching for informationTips for searching for information
Tips for searching for information
 
So you want to do a meta analysis
So you want to do a meta analysisSo you want to do a meta analysis
So you want to do a meta analysis
 
IS100 Week 8
IS100 Week 8IS100 Week 8
IS100 Week 8
 
How to build a data dictionary
How to build a data dictionaryHow to build a data dictionary
How to build a data dictionary
 
Discoverer_Revised
Discoverer_RevisedDiscoverer_Revised
Discoverer_Revised
 
Databases
DatabasesDatabases
Databases
 
Epidata presentation course for heath science
Epidata presentation course for heath scienceEpidata presentation course for heath science
Epidata presentation course for heath science
 
Analyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAnalyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index Designs
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basic
 
Apples to Apples How to Organize Content with Metadata in SharePoint
Apples to Apples How to Organize Content with Metadata in SharePointApples to Apples How to Organize Content with Metadata in SharePoint
Apples to Apples How to Organize Content with Metadata in SharePoint
 
ICPSR Find & Analyze Data
ICPSR Find & Analyze DataICPSR Find & Analyze Data
ICPSR Find & Analyze Data
 

Recently uploaded

M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityEric T. Tung
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...Any kyc Account
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxWorkforce Group
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxAndy Lambert
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...rajveerescorts2022
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...lizamodels9
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
John Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfJohn Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfAmzadHosen3
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 

Recently uploaded (20)

M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptx
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptx
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
John Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfJohn Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdf
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 

Library Instruction for BADM 298

  • 1. BADM 298: DR. MOTHKOVICH Let’s Find Some Datasets! Prof. O’Connor
  • 2. ASSIGNMENT •Locate dataset •800-5,000 rows, 10-20 parameters, (more is ok) •5-7 of the parameters must be numeric
  • 3. ASSIGNMENT PT.2 •You will need to find information about your topic •Online •Library databases & news sources Checkout the articles tab in the 299/298 BADM course guide
  • 4. ASSIGNMENT PART 3 •Write 8-10 page paper (single spaced) • No more the 3 charts graphs or figures in the paper the rest must be moved to the appendix • Cover sheet, table of contents, appendix, and references are not considered part of the 8-10 page paper.
  • 5. CITATION You will be citing your paper in MLA or APA depending on your topic MLA: Literature, Languages, Cultural Studies, & Social Sciences APA: Education, Psychology, Sciences, & Medicine • Cite a Dataset • Citation Guide
  • 6. DATASET RESEARCH You will need to locate 1. Description of the nature of the data 4. An overview of the Data Collection Methodology Don’t for get to… • Summarize the information in the beginning of your paper • & Cite it!
  • 7. TOPIC/EXPLORATION Start Your Search With… •General Topic/Area of Interest •Not all topics work well for large datasets. Pick a couple of topics you might be interested in as back-up.
  • 8. Examples of Large Data Set Friendly Topics • Homelessness • Wildfire • Climate Change • Carbon Footprint • Education • Social Media • Music • Public Safety & Crime • COVID-19 • Electricity • Sexually Transmitted Illness • Mental Illness & Disabilities
  • 9. ORIGINAL ANALYSIS VS. REVIEW You Are Doing Original Analysis Using Pre-Existing Data ORIGINAL ANALYSIS • Examines existing data • Creates something new • Adds to the academic conversation REVIEW • Summarizes what is already known about the data • No original analysis • Does not answer new previously unanswered questions about the data
  • 10. EXCEL Yes File extensions .CSV, .XLSX, .XLSM, .XLSB, .ODS Excel icon File Conversion Required or Add-ins File extensions .RDF, .JSON, .XML, .XLAM, .HTLM Don’t Use File extensions .PDF, .JPEG, All Image files Text files • ZIP Folder-contains multiple files and often many file types
  • 11. FILE SIZES •Check the file size if possible before downloading •Less than 100 KB not usually worth checking •1,200 KB or larger is promising •100,000 or larger is fine you can use a subset of the data if necessary •Megabyte is one thousand times bigger than a Kilobyte. A file measured in Megabytes MB is typically a good sign.
  • 12.
  • 13. Google Dataset-Search • Go to google dataset search
  • 14. GOOGLE Dataset Search: Best Sources For Datasets • data.world • Kaggle.com • worldbank.org • dataverse.harvard.edu • healthdata.gov
  • 15. GOOGLE DATASET SEARCH: Select tabular and free!
  • 16. Search a specific website using site:last part of URL
  • 18. Data Manipulation Tools (online websites) • Requires user to make a series of choices about years, geography, or special aspects of the data. •Removes user from the raw data. End result is graph, map, or other visual, although sometimes a small raw dataset is created. •Avoid data manipulation tools for this assignment! One exception would be for those interested in using the Census Data Explorer. Please set up a Book a Librarian Appointment with me if interested in
  • 19. Example of data manipulation tool: DON’T USE!
  • 24. Download dataset from healthdata.gov
  • 25. DATASET QUALITY Authorship • Who published the data/reliable source? • When possible get the dataset from the original source Completeness • Avoid datasets with lots of blanks, NA, X, etc. • Avoid datasets that are missing critical information such as author/source, methodology, data dictionary etc. Relevance • Is the information current? • Can you write an 8-10 page paper about the topic of your dataset? • How many of the Parameters are useable? • At least 9 useable columns/5 numeric • How much cleanup will I need to do before I can use this data? • Lots of blank columns • Columns without labels • Dataset is too large example 80 columns by 200,000 rows
  • 26. CREATING A SUBSET • Too many rows? • Try limiting by year, date, geography, or topic • Too many columns? • Are there some categories you can eliminate because you are less interested in analyzing them/discussing them in your paper? • Eliminate rows and columns that aren’t helpful for analysis such as… • Empty rows and columns • Columns that you don’t understand (check the dataset index and description for information!)
  • 27. HOW DO I LOCATE INFORMATION ABOUT MY DATASET? •Locate • Abstract • Methodology • and/or documents included with the data • Look for URLs these may take you to the original content or creators page
  • 28. WHAT IF A WRITE-UP IS NOT INCLUDED WITH THE DATASET? • Search for Identifying Information • Authorship (person, agency, organization) • See if a URL to more information is included with data
  • 29. CREATING A SUBSET OF YOUR DATA (datasets that are too large or need some clean-up)
  • 30. INFORMATION SOURCES • Browse the open web (.gov, .edu, and .org) are generally considered more reliable • Academic Search premier is our biggest general database • Use the “Articles tab” in the Business 298/299 Guide Go to the subject guides on the library’s home page and select the guide that relates to your topic Example: Education Subject Guide , Psychology Subject Guide, etc.
  • 31.
  • 32. Contact Me For Assistance • Book A Librarian Appointment You must have a UofSC Aiken email to book an appointment. This is for a one-on-one virtual appointment for help with locating raw data sets or data citation. • Email: susano@usca.edu • Phone: 803-641-3261 • In-person appointments are also available