SlideShare a Scribd company logo
Making the most of your data:
Duke Data & GIS Services
Angela Zoss
Data Visualization Coordinator
November 20, 2013

http://library.duke.edu/data
Data & GIS Services
•  Perkins 226 computing cluster
•  Walk-in consultations
•  Data collections
•  Workshops
•  Online instructional materials
Brandaleone Family Center for
Data and GIS Services
•  Perkins 226
•  Open whenever the library
is open
•  12 high-powered Dell
workstations
•  3 Bloomberg financial
workstations
•  Various data analysis, GIS,
and visualization software
packages available
http://library.duke.edu/data/about/lab.html
Walk-in Consulting

…or by appointment:
askdata@duke.edu
http://library.duke.edu/data/about/schedule.html
Workshops
•  Typically toward the beginning of the semester
•  1-2 hours, often hands-on
•  Various topics, including Data Processing/Statistical
Software Packages, GIS/Mapping, Visualization
http://library.duke.edu/data/news
For announcements, sign up for our listserv:
https://lists.duke.edu/sympa/subscribe/dgs-announce
Or watch our blog and twitter accounts:
http://blogs.library.duke.edu/data/, @duke_data, @duke_vis
SUPPORT AREAS
Support Area: Finding Data
•  Browse our curated collections list at:
http://library.duke.edu/data/collections/
(includes data sets specially licensed for Duke)
•  Email askdata@duke.edu and we’ll do some research
for you!
•  Guides for special types of data
–  Census

http://guides.library.duke.edu/content.php?pid=32369

–  Housing

http://library.duke.edu/data/courses/housing_foreclosures.html

–  Comtrade

http://guides.library.duke.edu/comtrade
Support Area: Storing Data
•  Newest support area
•  Storage & Compute Environments Comparison:
http://goo.gl/fGflCB
Support Area: Managing Data
•  Data Management Guide:

http://library.duke.edu/data/guides/data-management/index.html

•  DMP Tool @ Duke:
http://blogs.library.duke.edu/data/2013/07/12/data-managementplanning-advice-dmptool-duke/
Support Area: Analyzing Data
•  Available data analysis software packages:
–  Open Refine (also offer a workshop)
–  R (also offer a workshop)
–  Stata (also offer a workshop)
–  SAS
–  SPSS

•  Additional workshop on text analysis techniques
•  Walk-in consulting for real-time support
http://library.duke.edu/data/guides/
Support Area: Visualizing Data

http://guides.library.duke.edu/datavis/
Support Area: Visualizing Data
•  GIS (Geographic Information Systems) support
–  Workshops on ArcGIS and other online mapping tools
–  High powered computers with GIS software
–  Expert help from Data & GIS Staff

•  Visualization support, more broadly
–  Workshops on Tableau Public and best practices for
charts, graphs, posters, etc.
DATA TIPS
See especially:
https://github.com/veltman/learninglunches/tree/master/datahygiene
Data Tips: Make a Copy
•  Never process the only copy of your data
•  Keep backups – onsite and offsite
Data Tips: Take Note
•  Keep track of the processing steps you’ve taken
•  If possible, use tools that have commands or scripts
•  Document the reasons behind your decisions
Data Tips: Check Your Sources
•  Garbage in, garbage out
•  Level of analysis
Data Tips: Easy Come, Easy Go
•  Be wary of Excel

–  Volatile (easy to accidentally introduce errors)
–  Deceptively opaque
–  Limited validation capabilities
–  Better for simple data structures

•  Tips

–  Save often, multiple copies
–  Document processing steps (manually if necessary)
–  Introduction some validation checks (formulas,
visualizations)
–  Switch to database?
Visualization Tips
In a nutshell:
Simplify (but not the axis)
•  Reduce color
•  Focus on major trends
•  Consistent style/format/reference system
•  Do extra calculations so the users won’t have to
http://guides.library.duke.edu/topten
SHAMELESS PLUG
2014 Student Data Visualization Contest
http://bit.ly/viscontest14 | http://bit.ly/visgallery
•  Open to all Duke students
•  Submit any type of
visualization
•  Deadline January 19
•  2014 Awards:

–  Amazon gift cards (up to $250)
–  public exhibits
–  posters
QUESTIONS? SUGGESTIONS?
angela.zoss@duke.edu
http://twitter.com/duke_vis

More Related Content

Similar to Data & GIS Services, Duke University

Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Joel Herndon
 
Data Visualisation Workshop (GovHack Brisbane 2014)
Data Visualisation Workshop (GovHack Brisbane 2014)Data Visualisation Workshop (GovHack Brisbane 2014)
Data Visualisation Workshop (GovHack Brisbane 2014)
Anna Gerber
 
OU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data ClusterOU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data Cluster
Enrico Daga
 
02-Lifecycle.pptx
02-Lifecycle.pptx02-Lifecycle.pptx
02-Lifecycle.pptx
Shree Shree
 
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
Marek Maśko
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
Andrei Savu
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discovery
ARDC
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Michael Hiskey
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 
Bridging Big Data and Data Science Using Scalable Workflows
Bridging Big Data and Data Science Using Scalable WorkflowsBridging Big Data and Data Science Using Scalable Workflows
Bridging Big Data and Data Science Using Scalable Workflows
Ilkay Altintas, Ph.D.
 
advance computing and big adata analytic.pptx
advance computing and big adata analytic.pptxadvance computing and big adata analytic.pptx
advance computing and big adata analytic.pptx
TeddyIswahyudi1
 
Big_data_1674238705.ppt is a basic background
Big_data_1674238705.ppt is a basic backgroundBig_data_1674238705.ppt is a basic background
Big_data_1674238705.ppt is a basic background
NidhiAhuja30
 
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for NonprofitsNTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
Azavea
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
bodaceacat
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
Sara-Jayne Terp
 
AzureDay - Introduction Big Data Analytics.
AzureDay  - Introduction Big Data Analytics.AzureDay  - Introduction Big Data Analytics.
AzureDay - Introduction Big Data Analytics.
Łukasz Grala
 
Ds03 data analysis
Ds03   data analysisDs03   data analysis
Ds03 data analysis
DotNetCampus
 
Tableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeTableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My Life
Russell Spangler
 
Esri 2016 User Conference - ArcGIS Online steps for success
Esri 2016 User Conference - ArcGIS Online steps for successEsri 2016 User Conference - ArcGIS Online steps for success
Esri 2016 User Conference - ArcGIS Online steps for success
Bern Szukalski
 
UNit4.pdf
UNit4.pdfUNit4.pdf
UNit4.pdf
SugumarSarDurai
 

Similar to Data & GIS Services, Duke University (20)

Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
 
Data Visualisation Workshop (GovHack Brisbane 2014)
Data Visualisation Workshop (GovHack Brisbane 2014)Data Visualisation Workshop (GovHack Brisbane 2014)
Data Visualisation Workshop (GovHack Brisbane 2014)
 
OU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data ClusterOU RSE Tutorial Big Data Cluster
OU RSE Tutorial Big Data Cluster
 
02-Lifecycle.pptx
02-Lifecycle.pptx02-Lifecycle.pptx
02-Lifecycle.pptx
 
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
SQLSaturday 664 - Troubleshoot SQL Server performance problems like a Microso...
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
Ischools workshop - 4 - data discovery
Ischools workshop - 4 - data discoveryIschools workshop - 4 - data discovery
Ischools workshop - 4 - data discovery
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
Bridging Big Data and Data Science Using Scalable Workflows
Bridging Big Data and Data Science Using Scalable WorkflowsBridging Big Data and Data Science Using Scalable Workflows
Bridging Big Data and Data Science Using Scalable Workflows
 
advance computing and big adata analytic.pptx
advance computing and big adata analytic.pptxadvance computing and big adata analytic.pptx
advance computing and big adata analytic.pptx
 
Big_data_1674238705.ppt is a basic background
Big_data_1674238705.ppt is a basic backgroundBig_data_1674238705.ppt is a basic background
Big_data_1674238705.ppt is a basic background
 
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for NonprofitsNTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
NTEN Webinar - Data Cleaning and Visualization Tools for Nonprofits
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
AzureDay - Introduction Big Data Analytics.
AzureDay  - Introduction Big Data Analytics.AzureDay  - Introduction Big Data Analytics.
AzureDay - Introduction Big Data Analytics.
 
Ds03 data analysis
Ds03   data analysisDs03   data analysis
Ds03 data analysis
 
Tableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeTableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My Life
 
Esri 2016 User Conference - ArcGIS Online steps for success
Esri 2016 User Conference - ArcGIS Online steps for successEsri 2016 User Conference - ArcGIS Online steps for success
Esri 2016 User Conference - ArcGIS Online steps for success
 
UNit4.pdf
UNit4.pdfUNit4.pdf
UNit4.pdf
 

More from Angela Zoss

Visualization For Data Science
Visualization For Data ScienceVisualization For Data Science
Visualization For Data Science
Angela Zoss
 
Duke Data and Visualization Services
Duke Data and Visualization ServicesDuke Data and Visualization Services
Duke Data and Visualization Services
Angela Zoss
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization Research
Angela Zoss
 
Data Visualization on the Web - Intro to D3
Data Visualization on the Web - Intro to D3Data Visualization on the Web - Intro to D3
Data Visualization on the Web - Intro to D3
Angela Zoss
 
Practical Data Visualization
Practical Data VisualizationPractical Data Visualization
Practical Data Visualization
Angela Zoss
 
Data Visualization for Drought & Cross Border Crisis
Data Visualization for Drought & Cross Border CrisisData Visualization for Drought & Cross Border Crisis
Data Visualization for Drought & Cross Border Crisis
Angela Zoss
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities Data
Angela Zoss
 

More from Angela Zoss (7)

Visualization For Data Science
Visualization For Data ScienceVisualization For Data Science
Visualization For Data Science
 
Duke Data and Visualization Services
Duke Data and Visualization ServicesDuke Data and Visualization Services
Duke Data and Visualization Services
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization Research
 
Data Visualization on the Web - Intro to D3
Data Visualization on the Web - Intro to D3Data Visualization on the Web - Intro to D3
Data Visualization on the Web - Intro to D3
 
Practical Data Visualization
Practical Data VisualizationPractical Data Visualization
Practical Data Visualization
 
Data Visualization for Drought & Cross Border Crisis
Data Visualization for Drought & Cross Border CrisisData Visualization for Drought & Cross Border Crisis
Data Visualization for Drought & Cross Border Crisis
 
Creating and Processing Digital Humanities Data
Creating and Processing Digital Humanities DataCreating and Processing Digital Humanities Data
Creating and Processing Digital Humanities Data
 

Recently uploaded

The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
sayalidalavi006
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 

Recently uploaded (20)

The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 

Data & GIS Services, Duke University

  • 1. Making the most of your data: Duke Data & GIS Services Angela Zoss Data Visualization Coordinator November 20, 2013 http://library.duke.edu/data
  • 2. Data & GIS Services •  Perkins 226 computing cluster •  Walk-in consultations •  Data collections •  Workshops •  Online instructional materials
  • 3. Brandaleone Family Center for Data and GIS Services •  Perkins 226 •  Open whenever the library is open •  12 high-powered Dell workstations •  3 Bloomberg financial workstations •  Various data analysis, GIS, and visualization software packages available http://library.duke.edu/data/about/lab.html
  • 4. Walk-in Consulting …or by appointment: askdata@duke.edu http://library.duke.edu/data/about/schedule.html
  • 5. Workshops •  Typically toward the beginning of the semester •  1-2 hours, often hands-on •  Various topics, including Data Processing/Statistical Software Packages, GIS/Mapping, Visualization http://library.duke.edu/data/news For announcements, sign up for our listserv: https://lists.duke.edu/sympa/subscribe/dgs-announce Or watch our blog and twitter accounts: http://blogs.library.duke.edu/data/, @duke_data, @duke_vis
  • 7. Support Area: Finding Data •  Browse our curated collections list at: http://library.duke.edu/data/collections/ (includes data sets specially licensed for Duke) •  Email askdata@duke.edu and we’ll do some research for you! •  Guides for special types of data –  Census http://guides.library.duke.edu/content.php?pid=32369 –  Housing http://library.duke.edu/data/courses/housing_foreclosures.html –  Comtrade http://guides.library.duke.edu/comtrade
  • 8. Support Area: Storing Data •  Newest support area •  Storage & Compute Environments Comparison: http://goo.gl/fGflCB
  • 9. Support Area: Managing Data •  Data Management Guide: http://library.duke.edu/data/guides/data-management/index.html •  DMP Tool @ Duke: http://blogs.library.duke.edu/data/2013/07/12/data-managementplanning-advice-dmptool-duke/
  • 10. Support Area: Analyzing Data •  Available data analysis software packages: –  Open Refine (also offer a workshop) –  R (also offer a workshop) –  Stata (also offer a workshop) –  SAS –  SPSS •  Additional workshop on text analysis techniques •  Walk-in consulting for real-time support http://library.duke.edu/data/guides/
  • 11. Support Area: Visualizing Data http://guides.library.duke.edu/datavis/
  • 12. Support Area: Visualizing Data •  GIS (Geographic Information Systems) support –  Workshops on ArcGIS and other online mapping tools –  High powered computers with GIS software –  Expert help from Data & GIS Staff •  Visualization support, more broadly –  Workshops on Tableau Public and best practices for charts, graphs, posters, etc.
  • 14. Data Tips: Make a Copy •  Never process the only copy of your data •  Keep backups – onsite and offsite
  • 15. Data Tips: Take Note •  Keep track of the processing steps you’ve taken •  If possible, use tools that have commands or scripts •  Document the reasons behind your decisions
  • 16. Data Tips: Check Your Sources •  Garbage in, garbage out •  Level of analysis
  • 17. Data Tips: Easy Come, Easy Go •  Be wary of Excel –  Volatile (easy to accidentally introduce errors) –  Deceptively opaque –  Limited validation capabilities –  Better for simple data structures •  Tips –  Save often, multiple copies –  Document processing steps (manually if necessary) –  Introduction some validation checks (formulas, visualizations) –  Switch to database?
  • 18. Visualization Tips In a nutshell: Simplify (but not the axis) •  Reduce color •  Focus on major trends •  Consistent style/format/reference system •  Do extra calculations so the users won’t have to http://guides.library.duke.edu/topten
  • 20. 2014 Student Data Visualization Contest http://bit.ly/viscontest14 | http://bit.ly/visgallery •  Open to all Duke students •  Submit any type of visualization •  Deadline January 19 •  2014 Awards: –  Amazon gift cards (up to $250) –  public exhibits –  posters