SlideShare a Scribd company logo
1 of 20
© 2016© 2016
A Practical Approach to Analyzing
Healthcare Data
Chapter 3 – Tools for Data
Organization, Analysis, and
Presentation
© 2016
Data Organization Using
Databases
• Healthcare data is complex and often multi-
dimensional
– Provider
– Patients
– Insurance companies
– Services
• Providing an organizational structure for the data can
facilitate more efficient analysis and reporting
• Database – self-describing collection of integrated
records.
– Self-describing – contains a description of its own structure
– Integrated – data elements are related to each other
© 2016
Database Vocabulary
• Tables- two dimensional arrays of data
– Rows = records
– Columns = variables or attributes
• RDMS – Relational Database Management System
– Software that is used to hold and maintain data tables and
their relationships
• SQL – Structured Query Language
– Programming language used to communicate with a
relational database
• ERD – Entity Relationship Diagram
– Diagram that shows how tables in an RDMS relate
© 2016
Small-Scale Databases
• Microsoft Access
– Relational database management system (RDMS)
– Strengths
• Approachable for inexperienced database users
• Graphical user interface (GUI)
• Built in wizards for table, query and report design
• Supports integrated forms for data collection
– Weaknesses
• Multi-user support is difficult to implement
• Database size limited to 2GB
• Most ‘classic’ Database Administrators (DBAs) do not
consider Access to be a full blown RDMS
© 2016
Hierarchy of a Relational Database
• Tables are rows and columns of values
– Envision a tab in a spreadsheet
• Fields are the columns in a spreadsheet
– In a patient database, fields may be age, gender, admission date, etc.
• Data elements or records are the rows in a spreadsheet
– In a patient database, row may represent patients or services provided to patients
• A unique row identifier in a table is called the primary key
– Cannot be duplicated within the same table
– Used to link tables together
© 2016
Entity Relationship Diagram
• Diagram that displays the relationships between tables within a RDBMS
• The key fields are identified as well as the cardinality of the relationship between the
tables
• Cardinality
– One – to – one
– One – to – many
• In Figure A.1
– PatientID is the primary key in the Patient Info table and a secondary key in the Visits table
– The cardinality of the relationship between the table is one to many.
• Each patient may have many visits
© 2016
More About Cardinality
© 2016
Data Dictionary
• Details roadmap of the database
• Should include
– Name of computer or software program that
contains the data element
– Type of data in the field
– Length of data in the field
– Edits placed on the data field
– Values allowed to be placed in the data field
– A clear definition of each value
© 2016
Structured Query Language
• SQL
• Tool to use and maintain databases
– Select data
– Update data
– Insert rows into a table
– Delete rows from a table
© 2016
SQL Example
• Retrieve the records for all patients
from Milwaukee
– SELECT PATIENT_LNAME,
PATIENT_FNAME FROM PATIENT
WHERE PATIENT_CITY = ‘Milwaukee’
• Key words in the query are in red font
© 2016
Statistical Software Packages
• Statistical Package for the Social Sciences
(SPSS)
– Menu driven program
– More suited for smaller (<1G) datasets
– May open Excel files for analysis
• Statistical Analysis System (SAS)
– Command line program
– Excellent for manipulating large datasets
© 2016
SAS Syntax
• SAS is a programming language much like
SQL
– Key words:
• Data – used to name and create a dataset
• Proc – declare which analytic procedure will be used
• Set – declare which dataset will be the subject of the
analysis
• Run – designates the end of the command and starts
the calculation
– Syntax: always end commands with a ‘;’
© 2016
Role of Excel – Data Analysis
• Data analysis
– Data analysis toolpak add-in
– Must be careful importing text data elements
that appear to be numeric
• ICD-9 codes with leading zeroes
• Variables with mixed number and alpha values
• Pivot tables are also an excellent
exploratory data analysis tool
© 2016
Graphical Displays of Data
• Types of graphical comparisons
– Group summary
– Trends or changes over time
– Relative size of groups
– Relationships between variables
© 2016
Bar Graph or Chart
• Group summary
• Comparison of counts or averages across
groups
• Two variables: admissions, age category.
– One bar for each gender
© 2016
Line Graphs or Chart
• Trends or changes over time
• Look for trends/patterns
• Should not be used for connecting unrelated points
© 2016
Pie Chart
• Compares relative size of groups
• Used to represent relative proportions of a total
• Note that this is different than a bar chart – in a pie
chart categories must be part of a bigger set or
population
© 2016
Scatter Diagrams
• Used to display the relationship between two
continuous variables
• Should not be used if either variable is categorical
© 2016
Infographics
• Conveys a message or story using a
combination of graphs and text
• Primary types:
– Cause and effect
– Chronological
– Quantitative
– Directional
– Product
© 2016
Tables versus Graphs
• Tables have several advantages over graphs such
as:
– Present more information than a graph
– Display the exact values
– Require less work to create
• Graphs also have advantages over tables such as:
– Catch the attention of the reader
– Show trends easily
– Bring out facts or relationships that stimulate thinking

More Related Content

What's hot

RES814 U1 Individual Project
RES814 U1 Individual ProjectRES814 U1 Individual Project
RES814 U1 Individual Project
ThienSi Le
 
Data processing and analysis final
Data processing and analysis finalData processing and analysis final
Data processing and analysis final
Akul10
 

What's hot (19)

PSPP overview and Introduction to R & R Commander
PSPP overview and Introduction to R & R CommanderPSPP overview and Introduction to R & R Commander
PSPP overview and Introduction to R & R Commander
 
A07 - Best Practices for Asset Reporting (MCU)
A07 - Best Practices for Asset Reporting (MCU)A07 - Best Practices for Asset Reporting (MCU)
A07 - Best Practices for Asset Reporting (MCU)
 
Introduction to ms access database
Introduction to ms access databaseIntroduction to ms access database
Introduction to ms access database
 
RES814 U1 Individual Project
RES814 U1 Individual ProjectRES814 U1 Individual Project
RES814 U1 Individual Project
 
Using drill down within alma analytics reports
Using drill down within alma analytics reportsUsing drill down within alma analytics reports
Using drill down within alma analytics reports
 
The path to flexible loading of patron records
The path to flexible loading of patron recordsThe path to flexible loading of patron records
The path to flexible loading of patron records
 
Discovery layer decisions, configurations and strategies
Discovery layer decisions, configurations and strategiesDiscovery layer decisions, configurations and strategies
Discovery layer decisions, configurations and strategies
 
Data Dictionary
Data DictionaryData Dictionary
Data Dictionary
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning
 
Se 381 - lec 21 - 23 - 12 may09 - df-ds and data dictionary
Se 381 - lec 21 - 23 - 12 may09 - df-ds and data dictionarySe 381 - lec 21 - 23 - 12 may09 - df-ds and data dictionary
Se 381 - lec 21 - 23 - 12 may09 - df-ds and data dictionary
 
Data processing and analysis final
Data processing and analysis finalData processing and analysis final
Data processing and analysis final
 
Overview of the preparation of tables in the new CountrySTAT platform
Overview of the preparation of tables in the new CountrySTAT platformOverview of the preparation of tables in the new CountrySTAT platform
Overview of the preparation of tables in the new CountrySTAT platform
 
Presentation1
Presentation1Presentation1
Presentation1
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
 
Data Dictionary in System Analysis and Design
Data Dictionary in System Analysis and DesignData Dictionary in System Analysis and Design
Data Dictionary in System Analysis and Design
 
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
Measure Twice and Cut Once: How a Budget Cut Impacted Subscription Renewals f...
 
Deposit data to data centre: ADP case
Deposit data to data centre: ADP caseDeposit data to data centre: ADP case
Deposit data to data centre: ADP case
 
Abap data dictionary
Abap data dictionaryAbap data dictionary
Abap data dictionary
 
Introduction to sas
Introduction to sasIntroduction to sas
Introduction to sas
 

Similar to Hm306 week 2

ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptxICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
Amanda783100
 

Similar to Hm306 week 2 (20)

Hm306 week 1 ppt A
Hm306 week 1 ppt AHm306 week 1 ppt A
Hm306 week 1 ppt A
 
Hm306 week 1 ppt 1
Hm306 week 1 ppt 1Hm306 week 1 ppt 1
Hm306 week 1 ppt 1
 
Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
 
Introduction to Database System-WEEK2.pptx
Introduction to Database System-WEEK2.pptxIntroduction to Database System-WEEK2.pptx
Introduction to Database System-WEEK2.pptx
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
 
Data Visualization & Analytics.pptx
Data Visualization & Analytics.pptxData Visualization & Analytics.pptx
Data Visualization & Analytics.pptx
 
Application of excel and spss programme in statistical
Application of excel and spss programme in statisticalApplication of excel and spss programme in statistical
Application of excel and spss programme in statistical
 
data_science_introduction_for beginners.pptx
data_science_introduction_for beginners.pptxdata_science_introduction_for beginners.pptx
data_science_introduction_for beginners.pptx
 
designing reports
designing reportsdesigning reports
designing reports
 
Data base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somyaData base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somya
 
Spss by vijay ambast
Spss by vijay ambastSpss by vijay ambast
Spss by vijay ambast
 
Database Management Systems.pptx
Database Management Systems.pptxDatabase Management Systems.pptx
Database Management Systems.pptx
 
Dw-dm-part-03
Dw-dm-part-03Dw-dm-part-03
Dw-dm-part-03
 
Uses of SPSS and Excel to analyze data
Uses of SPSS and Excel   to analyze dataUses of SPSS and Excel   to analyze data
Uses of SPSS and Excel to analyze data
 
Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Application of Excel and SPSS software for statistical analysis- Biostatistic...
Application of Excel and SPSS software for statistical analysis- Biostatistic...Application of Excel and SPSS software for statistical analysis- Biostatistic...
Application of Excel and SPSS software for statistical analysis- Biostatistic...
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata Harmonisation
 
SEM 8 BIOSTATISTICS graphs minitab excel etc
SEM 8 BIOSTATISTICS graphs minitab excel etcSEM 8 BIOSTATISTICS graphs minitab excel etc
SEM 8 BIOSTATISTICS graphs minitab excel etc
 
SPSS.pptx(unit 4).pptx
SPSS.pptx(unit 4).pptxSPSS.pptx(unit 4).pptx
SPSS.pptx(unit 4).pptx
 
ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptxICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
ICT NOTESmbjbujbhbuhuhhipv;ihsjhis 7.pptx
 

More from BealCollegeOnline (20)

BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressuresBA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
BA650 Week 3 Chapter 3 "Why Change? contemporary drivers and pressures
 
BIO420 Chapter 25
BIO420 Chapter 25BIO420 Chapter 25
BIO420 Chapter 25
 
BIO420 Chapter 24
BIO420 Chapter 24BIO420 Chapter 24
BIO420 Chapter 24
 
BIO420 Chapter 23
BIO420 Chapter 23BIO420 Chapter 23
BIO420 Chapter 23
 
BIO420 Chapter 20
BIO420 Chapter 20BIO420 Chapter 20
BIO420 Chapter 20
 
BIO420 Chapter 18
BIO420 Chapter 18BIO420 Chapter 18
BIO420 Chapter 18
 
BIO420 Chapter 17
BIO420 Chapter 17BIO420 Chapter 17
BIO420 Chapter 17
 
BIO420 Chapter 16
BIO420 Chapter 16BIO420 Chapter 16
BIO420 Chapter 16
 
BIO420 Chapter 13
BIO420 Chapter 13BIO420 Chapter 13
BIO420 Chapter 13
 
BIO420 Chapter 12
BIO420 Chapter 12BIO420 Chapter 12
BIO420 Chapter 12
 
BIO420 Chapter 09
BIO420 Chapter 09BIO420 Chapter 09
BIO420 Chapter 09
 
BIO420 Chapter 08
BIO420 Chapter 08BIO420 Chapter 08
BIO420 Chapter 08
 
BIO420 Chapter 06
BIO420 Chapter 06BIO420 Chapter 06
BIO420 Chapter 06
 
BIO420 Chapter 05
BIO420 Chapter 05BIO420 Chapter 05
BIO420 Chapter 05
 
BIO420 Chapter 04
BIO420 Chapter 04BIO420 Chapter 04
BIO420 Chapter 04
 
BIO420 Chapter 03
BIO420 Chapter 03BIO420 Chapter 03
BIO420 Chapter 03
 
BIO420 Chapter 01
BIO420 Chapter 01BIO420 Chapter 01
BIO420 Chapter 01
 
BA350 Katz esb 6e_chap018_ppt
BA350 Katz esb 6e_chap018_pptBA350 Katz esb 6e_chap018_ppt
BA350 Katz esb 6e_chap018_ppt
 
BA350 Katz esb 6e_chap017_ppt
BA350 Katz esb 6e_chap017_pptBA350 Katz esb 6e_chap017_ppt
BA350 Katz esb 6e_chap017_ppt
 
BA350 Katz esb 6e_chap016_ppt
BA350 Katz esb 6e_chap016_pptBA350 Katz esb 6e_chap016_ppt
BA350 Katz esb 6e_chap016_ppt
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Hm306 week 2

  • 1. © 2016© 2016 A Practical Approach to Analyzing Healthcare Data Chapter 3 – Tools for Data Organization, Analysis, and Presentation
  • 2. © 2016 Data Organization Using Databases • Healthcare data is complex and often multi- dimensional – Provider – Patients – Insurance companies – Services • Providing an organizational structure for the data can facilitate more efficient analysis and reporting • Database – self-describing collection of integrated records. – Self-describing – contains a description of its own structure – Integrated – data elements are related to each other
  • 3. © 2016 Database Vocabulary • Tables- two dimensional arrays of data – Rows = records – Columns = variables or attributes • RDMS – Relational Database Management System – Software that is used to hold and maintain data tables and their relationships • SQL – Structured Query Language – Programming language used to communicate with a relational database • ERD – Entity Relationship Diagram – Diagram that shows how tables in an RDMS relate
  • 4. © 2016 Small-Scale Databases • Microsoft Access – Relational database management system (RDMS) – Strengths • Approachable for inexperienced database users • Graphical user interface (GUI) • Built in wizards for table, query and report design • Supports integrated forms for data collection – Weaknesses • Multi-user support is difficult to implement • Database size limited to 2GB • Most ‘classic’ Database Administrators (DBAs) do not consider Access to be a full blown RDMS
  • 5. © 2016 Hierarchy of a Relational Database • Tables are rows and columns of values – Envision a tab in a spreadsheet • Fields are the columns in a spreadsheet – In a patient database, fields may be age, gender, admission date, etc. • Data elements or records are the rows in a spreadsheet – In a patient database, row may represent patients or services provided to patients • A unique row identifier in a table is called the primary key – Cannot be duplicated within the same table – Used to link tables together
  • 6. © 2016 Entity Relationship Diagram • Diagram that displays the relationships between tables within a RDBMS • The key fields are identified as well as the cardinality of the relationship between the tables • Cardinality – One – to – one – One – to – many • In Figure A.1 – PatientID is the primary key in the Patient Info table and a secondary key in the Visits table – The cardinality of the relationship between the table is one to many. • Each patient may have many visits
  • 7. © 2016 More About Cardinality
  • 8. © 2016 Data Dictionary • Details roadmap of the database • Should include – Name of computer or software program that contains the data element – Type of data in the field – Length of data in the field – Edits placed on the data field – Values allowed to be placed in the data field – A clear definition of each value
  • 9. © 2016 Structured Query Language • SQL • Tool to use and maintain databases – Select data – Update data – Insert rows into a table – Delete rows from a table
  • 10. © 2016 SQL Example • Retrieve the records for all patients from Milwaukee – SELECT PATIENT_LNAME, PATIENT_FNAME FROM PATIENT WHERE PATIENT_CITY = ‘Milwaukee’ • Key words in the query are in red font
  • 11. © 2016 Statistical Software Packages • Statistical Package for the Social Sciences (SPSS) – Menu driven program – More suited for smaller (<1G) datasets – May open Excel files for analysis • Statistical Analysis System (SAS) – Command line program – Excellent for manipulating large datasets
  • 12. © 2016 SAS Syntax • SAS is a programming language much like SQL – Key words: • Data – used to name and create a dataset • Proc – declare which analytic procedure will be used • Set – declare which dataset will be the subject of the analysis • Run – designates the end of the command and starts the calculation – Syntax: always end commands with a ‘;’
  • 13. © 2016 Role of Excel – Data Analysis • Data analysis – Data analysis toolpak add-in – Must be careful importing text data elements that appear to be numeric • ICD-9 codes with leading zeroes • Variables with mixed number and alpha values • Pivot tables are also an excellent exploratory data analysis tool
  • 14. © 2016 Graphical Displays of Data • Types of graphical comparisons – Group summary – Trends or changes over time – Relative size of groups – Relationships between variables
  • 15. © 2016 Bar Graph or Chart • Group summary • Comparison of counts or averages across groups • Two variables: admissions, age category. – One bar for each gender
  • 16. © 2016 Line Graphs or Chart • Trends or changes over time • Look for trends/patterns • Should not be used for connecting unrelated points
  • 17. © 2016 Pie Chart • Compares relative size of groups • Used to represent relative proportions of a total • Note that this is different than a bar chart – in a pie chart categories must be part of a bigger set or population
  • 18. © 2016 Scatter Diagrams • Used to display the relationship between two continuous variables • Should not be used if either variable is categorical
  • 19. © 2016 Infographics • Conveys a message or story using a combination of graphs and text • Primary types: – Cause and effect – Chronological – Quantitative – Directional – Product
  • 20. © 2016 Tables versus Graphs • Tables have several advantages over graphs such as: – Present more information than a graph – Display the exact values – Require less work to create • Graphs also have advantages over tables such as: – Catch the attention of the reader – Show trends easily – Bring out facts or relationships that stimulate thinking