University of Maryland University College
STAT200 - Assignment #1: Descriptive Statistics Data Analysis Plan
Identifying Information
Student (Full Name):
Class:
Instructor:
Date:
Scenario: Please write a few lines describing your scenario and the four variables (in addition to income) you have selected.
Use Table 1 to report the variables selected for this assignment. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 1. Variables Selected for the Analysis
Variable Name in the Data Set
Description
(See the data dictionary for describing the variables.)
Type of Variable
(Qualitative or Quantitative)
Variable 1: “Income”
Annual household income in USD.
Quantitative
Variable 2:
Variable 3:
Variable 4:
Variable 5:
Reason(s) for Selecting the Variables and Expected Outcome(s):
1. 1. Variable 1: “Income” -
2. 2. Variable 2: “ “ -
3. 3. Variable 3: “ “ -
4. 4. Variable 4: “ “ -
5. 5. Variable 5: “ “ -
Data Set Description:
Proposed Data Analysis:
Measures of Central Tendency and Dispersion
Complete Table 2. Numerical Summaries of the Selected Variables and briefly explain why you choose those measurements. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 2. Numerical Summaries of the Selected Variables
Variable Name
Measures of Central Tendency and Dispersion
Rationale for Why Appropriate
Variable 1:
“Income”
· ● Number of Observations
· ● Median
· ● Sample Standard Deviation
I am using median for two reasons:
1. 1. If there are any outliers or the data is not normally distributed, the median is the best measure of central tendency.
2. 2. The variable is quantitative.
I am using sample standard deviation for three reasons:
1. 1. The data is a sample from a larger data set.
2. 2. It is the most commonly used measure of dispersion.
3. 3. The variable is quantitative.
Variable 2:
Variable 3:
Variable 4:
Variable 5:
Graphs and/or Tables
Complete Table 3. Type of Graphs and/or Table for Selected Variables and briefly explain why you choose those graphs and/or tables. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 3. Type of Graphs and/or Tables for Selected Variables
Variable Name
Graph and/or Table
Rationale for why Appropriate?
Variable 1:
“Income”
Graph: I will use the histogram to show the normal distribution of data.
Histogram is one of the best plot to show the normal distribution of quantitative level data .
Variable 2:
Variable 3:
Variable 4:
Variable 5:
ITS 833 – INFORMATION GOVERNANCE
Chapter 7
Business Considerations for a Successful IG Program
1
1
CHAPTER GOALS AND OBJECTIVES
Understand the differe ...
University of Maryland University CollegeSTAT200 - Assignment #1.docx
1. University of Maryland University College
STAT200 - Assignment #1: Descriptive Statistics Data Analysis
Plan
Identifying Information
Student (Full Name):
Class:
Instructor:
Date:
Scenario: Please write a few lines describing your scenario and
the four variables (in addition to income) you have selected.
Use Table 1 to report the variables selected for this assignment.
Note: The information for the required variable, “Income,” has
already been completed and can be used as a guide for
completing information on the remaining variables.
Table 1. Variables Selected for the Analysis
Variable Name in the Data Set
Description
(See the data dictionary for describing the variables.)
Type of Variable
(Qualitative or Quantitative)
Variable 1: “Income”
Annual household income in USD.
Quantitative
Variable 2:
Variable 3:
2. Variable 4:
Variable 5:
Reason(s) for Selecting the Variables and Expected
Outcome(s):
1. 1. Variable 1: “Income” -
2. 2. Variable 2: “ “ -
3. 3. Variable 3: “ “ -
4. 4. Variable 4: “ “ -
5. 5. Variable 5: “ “ -
Data Set Description:
Proposed Data Analysis:
Measures of Central Tendency and Dispersion
Complete Table 2. Numerical Summaries of the Selected
Variables and briefly explain why you choose those
measurements. Note: The information for the required variable,
“Income,” has already been completed and can be used as a
guide for completing information on the remaining variables.
Table 2. Numerical Summaries of the Selected Variables
Variable Name
Measures of Central Tendency and Dispersion
Rationale for Why Appropriate
Variable 1:
“Income”
· ● Number of Observations
· ● Median
· ● Sample Standard Deviation
3. I am using median for two reasons:
1. 1. If there are any outliers or the data is not normally
distributed, the median is the best measure of central tendency.
2. 2. The variable is quantitative.
I am using sample standard deviation for three reasons:
1. 1. The data is a sample from a larger data set.
2. 2. It is the most commonly used measure of dispersion.
3. 3. The variable is quantitative.
Variable 2:
Variable 3:
Variable 4:
Variable 5:
Graphs and/or Tables
Complete Table 3. Type of Graphs and/or Table for Selected
Variables and briefly explain why you choose those graphs
and/or tables. Note: The information for the required variable,
“Income,” has already been completed and can be used as a
guide for completing information on the remaining variables.
Table 3. Type of Graphs and/or Tables for Selected Variables
Variable Name
Graph and/or Table
Rationale for why Appropriate?
Variable 1:
4. “Income”
Graph: I will use the histogram to show the normal distribution
of data.
Histogram is one of the best plot to show the normal
distribution of quantitative level data .
Variable 2:
Variable 3:
Variable 4:
Variable 5:
ITS 833 – INFORMATION GOVERNANCE
Chapter 7
Business Considerations for a Successful IG Program
1
1
5. CHAPTER GOALS AND OBJECTIVES
Understand the difference between structured and unstructured
data
Understand the difference between unstructured and semi-
structured information
Challenges of unstructured data
Understand the concept of full cost accounting (FCA)
10 Key factors that drive the total cost of ownership (TCO) of
unstructured data
Managing information
Understand how an IG-enabled organization is different from
one that is not IG-enabled
2
2
The Business Case for Information Governance
Difficult to justify
Non-existent short term return on investment (ROI)
Long-term view is essential
Reduces exposure to risk over time
Improves the quality and security of information
Streamlines the retention of information
Allows us to look at information costs in a different light
3
Information Environment
Challenges of Unstructured Information
Unstructured information (email, word documents, social
media) is growing faster than the ability to effectively manage
6. it and necessitates the use of different approaches than those
used to manage structured databases
Amount of data growing 44 times existing size over next 10
years while unstructured information growing 50 percent over
the rate of structured data.
Challenges unique to unstructured information include:
Horizontal nature – across the enterprise
Lack of formality
Management location
Ownership issues
Classification
4
Information Environment
Calculating Information Costs
Rising storage costs (short-sighted thinking)
Labor (particularly knowledge workers)
Overhead costs
“Dark information”
Costs of e-discovery and litigation
Opportunity costs
5
Full Cost Accounting for Information Models
Total Cost of Ownership (TCO) Model
Return on Investment (ROI)
Full Cost Accounting Model (FCA)
Past, Present, Future Costs
Direct Costs
Indirect Costs
Flexible Application
Triple Bottom Line Accounting – Monetary, Environment,
7. Societal Costs
Full Cost Accounting (FCA)
General and Administrative costs
Productivity Gains and Losses
Legal and E-discovery costs
Indirect Costs
Up-Front Costs
Future Costs
6
7
8
9
8. The End
[email protected] Geanie Assante 2019
12
12
ITS 833 – INFORMATION GOVERNANCE
Chapter 6
Information Governance Policy Development
1
1
CHAPTER GOALS AND OBJECTIVES
Know the 8 Generally Accepted Recordkeeping Principles®
What is the IG Reference Model?
What does the IGRM Diagram consist of?
What are the best practice considerations?
What is the benefits and risks of having standards?
What are the key standards relevant to IG
2
2
A Review of the 8 Generally Accepted
10. Outer Ring: Complex set of interoperable processes and
implementing he procedures and structural element to put them
into practice
Requirements:
Understanding of business imperatives
Knowledge of appropriate tools and infrastructure
Sensitivity to legal and regulatory obligations
Inner Ring: Depicts a work-flow (life-cycle) diagram. Shows
that information management is important at all stages of the
lifecycle
5
5
How the IGRM Diagram related to the
Generally Accepted Recordkeeping Principles®
Support the ARMA Principle by identifying the cross-functional
groups of IG stakeholders
Depicts the intersecting objectives of the organization
Depicts the relationship duty, value and information assets
Used by proactive organizations as an introspective lens to
facilitate visualization, understanding and discussion
concerning how to apple the “Principles” to the organization.
Puts focus on the “Principles”
Provides essential context for the maturity model
6
6
Considerations in IG Policy Formation
11. Best Practices?
YES!
Understand that Best Practices will vary per organization
Review 25 generic Best Practices, Pages 75 and 76 of text book
7
Standards?
YES!
Two types to consider
De Jure Standards - Legal standards published by standards
setting bodies such as IOS, ANSI, NIST, BTS and others
De Facto Standards – Informal standards regarded by many as
actual standards – arising through popular use (Example:
Windows in the business world in 2001-2010). May be
published by formal standards setting bodies without having
“Formal” status
7
Benefits and Risks of Standards
Benefits
Quality Assurance Support
Interoperability Support
Implementation Framework and Certification Checklists
Cost Reduction
International Consensus
8
Risks
Possible Decreased Flexibility
Standards Confusion
Real-World Shortcomings to Due Theoretical Basis
Cost and Maintenance Involving in Updating Standard
12. 8
KEY STANDARDS RELEVANT TO IG
Risk Management
ISO 31000-2009 – States principles and generic guidelines of
risk management applicable to IG
Provides a structured framework for development and
implementation of risk management strategies and programs
“Risk Management Framework”: Set of two basic components
(foundations and organizational arrangements) that support and
sustain risk management throughout the organization.
9
9
KEY STANDARDS RELEVANT TO IG
Information Security Management
ISO/IEC 27001:2005- Information Security Management System
Standard that provides guidance in development of security
controls for protection of information assets
Flexible –can be applied to different activities and processes
Includes use of standards by auditors and stakeholders
ISO/IEC 27002:2005-Information Technology-Security
Techniques-Code of Practice for Information Security
Establishes guidelines and general principle for initiating,
implementing, maintaining and improving information security
mgt.
Includes Best Practices of Control Objectives in 11 key areas of
information security management
ISO/IE 38500:2008 –International Standard for high-level
principle and guidance for senior executives and directors, and
advisors for effective and efficient use of IT
13. Three major sections
Scope, Application and Objectives
Framework for Good Corporate Governance of IT
Guidance for Corporate Governance of IT
10
10
KEY STANDARDS RELEVANT TO IG
RECORDS AND E-RECORDS MANAGEMENT
ISO 15489-1:2001 and ISO 15489-2:2001– International
Standard for Records Management
Part 1:Provides a framework and high-level overview of RM
core principles
Part 1:Defines RM as “Field of management responsibility for
the efficient and systematic control of creation receipt,
maintenance, use and disposition of records, including
processes for capturing and maintaining evidence of and
information about business activities and transactions in the
form of records”1
Part 2: Technical Specifications and Methodology for
implementing standard
ISO 30300;2011 – Information and Documentation-Management
Systems for Records-Fundamentals and Vocabulary
ISO 30301:2011 – Information and Documentation-Management
Systems for Records – Requirements
1ISO 15489-1:2001 Information and Documentation-Records
Management, Part 1:General Geneva: ISO, 2001), section 3.16.
11
14. 11
NATIONAL, INTERNATIONAL AND REGIONAL ERM
STANDARDS
United States E-Records Standard
U.S. DOD 5015.2 Design Criteria Standard For Electronic
Records Management Software Applications
Developed in 1997
Updated in 2002 and 2007
Canadian Standards
Electronic Records as Documentary Evidence CAN/CGSB-
72.34-2005
Microfilm and Electronic Images as Documentary Evidence
CAN/DGSB-72.11-93
Canadian Legal Considerations
Relies on prime directive-that an organization shall always be
prepared to produce its records as evidence- and its national
standards, for the admissibility of electronic records in court
proceedings
The admissibility of records as evidence is determined under the
business records provisions of the Evidence Act
12
12
NATIONAL, INTERNATIONAL AND REGIONAL ERM
STANDARDS…CONTINUED
United Kingdom
The National Archives
To sets of functions requirements to promote the development
15. of the electronic records management software market (one in
1999 and one in 2002)
Model Requirements of Electronic Records
MoReq2
MoReq2010
Australian ERM and Records Management Standards
Has consistently been world leader in this area
Adopted all three parts of ISO 16175 as its e-records standard
Australian Government Recordkeeping Metadata Standard
Version 2.0
Australian Government Locator Service
AS 5090:2003 – Work Process Analysis for Recordkeeping
13
13
LONG-TERM DIGITAL PRESERVATION
Referred to as “LTDP”
LTDP is a key area for IG policy development
Frequently not addressed in an IG plan
Should be applied in preserving historical and “vital records”
and in order to maintain its corporate or organizational memory
Key Standards for LTDP:
PDF/A-2 –official standard format for preserving electronic
documents, developed by Adobe.
ISO 19005-1:2005 Document Management is the published
specification requiring PDF format
ISO 14721:2012 – Space Data and Information Transfer
Systems –Open Archival Information Systems
ISO TR 18492(2005) – Long Term Preservation of Electronic
Document Based Information
ISO 16363:2012 – Space Data and Information Transfer
Systems-Audit and Certification of Trustworthy Digital
16. Repositories
14
14
BUSINESS CONTINUITY MANAGEMENT
ISO 22301:2012 – Societal Security – Business Continuity
Management Systems Requirements
Specifies requirements for creating and implementing a
standardized approach to business continuity management -----
this is also known as Disaster Recovery
Benefits of ISO 22301
Threat Identification and Assessment
Threat and Recovery Planning
Mission-critical process protection
Stakeholder Confidence
15
15
THINGS TO REMEMBER IN DEVELOPING THE IG POLICY
Take into account organizational goals
Draw clear lines of authority
Make sure you have an executive sponsor who can garner
executive support for the IG program and policies
IG program must contain communications and training
component
Stakeholders must be made aware of new policies and practices
17. Make sure you have metrics that are relevant and useful and can
actually be measured
Test and audit
Give feedback to employees based upon metrics, tests and audit
results
Establish and enforce clear penalties for policy violations and
communicate that to employees
Take into account organizational culture
16
16
The End
[email protected] Geanie Assante 2019
17
17
STAT200 Introduction to Statistics
Dataset for Written Assignments
Description of Dataset:
The data is a random sample from the US Department of
Labor’s 2016 Consumer Expenditure Surveys (CE) and provides
information about the composition of households and their
annual expenditures (https://www.bls.gov/cex/). It contains
information from 30 households, where a survey responder
provided the requested information; it is all self-reported
information. This dataset contains four socioeconomic variables
(whose names start with SE) and four expenditure variables
(whose names start with USD).
18. Description of Variables/Data Dictionary:
The following table is a data dictionary that describes the
variables and their locations in this dataset (Note: Dataset is on
second page of this document):
Variable Name
Location in Dataset
Variable Description
Coding
UniqueID#
First Column
Unique number used to identify each survey responder
Each responder has a unique number from 1-30
SE-MaritalStatus
Second Column
Marital Status of Head of Household
Not Married/Married
SE-Income
Third Column
Annual Household Income
Amount in US Dollars
SE-AgeHeadHousehold
Fourth Column
Age of the Head of Household
Age in Years
SE-FamilySize
Fifth Column
Total Number of People in Family (Both Adults and Children)
Number of People in Family
USD-Annual Expenditures
Sixth Column
Total Amount of Annual Expenditures
Amount in US Dollars
USD-Food
Seventh Column
Total Amount of Annual Expenditure on Food
19. Amount in US Dollars
USD-Entertainment
Eighth Column
Total Amount of Annual Expenditure on Entertainment
Amount in US Dollars
USD-Education
Ninth Column
Total Amount of Annual Expenditure on Education
Amount in US Dollars
How to read the data set: Each row contains information from
one household. For instance, the first row of the dataset starting
on the next page shows us that: the head of household is not
married and is 52 years old, has an annual household income of
$95,744, a family size of 4, annual expenditures of $55,963, and
spends $7,040 on food, $105 on entertainment, and $340 on
education.
UniqueID#
SE-MaritalStatus
SE-Income
SE-AgeHeadHousehold
SE-FamilySize
USD-AnnualExpenditures
USD-Food
USD-Entertainment
USD-Education
1
Not Married
95744
52
4
55963
27. 9321
133
478
30
Married
114932
44
5
81186
11077
80
111
STAT200: Written Assignment #1 - Descriptive Statistics Data
Analysis Plan - Instructions
Page 1 of 4
STAT200 Introduction to Statistics
Assignment #1: Descriptive Statistics Data Analysis Plan
Assignment #1: Prepare Descriptive Statistics Data Analysis
Plan
Before conducting any statistical analyses, researchers develop
a plan for how they will analyze their
data to answer their research questions. The purpose of this
assignment is to provide an experience
28. developing a descriptive statistics analysis plan. Note: This first
assignment is a plan only; no statistics
will be calculated or graphs created. The second assignment
will involve carrying out the plan, after
receiving feedback from your instructor.
Assignment Steps:
Step #1: Review the STAT200 data set file. (Note: This data set
will be used for all three of this term’s
written assignments).
The data is a subsample from the US Department of Labor’s
Consumer Expenditure Surveys (CE) and
provides information about the composition of households and
their annual expenditures
(https://www.bls.gov/cex/). Detailed information on the sample
and variables is included with the data
set file; please carefully review this information to familiarize
yourself with the data (Note: This
information will be used in Assignment #2 to describe the
dataset).
Step #2: Develop descriptive statistics data analysis plan.
➢ Task 1: Develop scenario. Imagine that you are head of a
29. household and have to determine a
household budget plan based on the data available from the
dataset. For instance, you are a 35
year old single parent with a high school diploma and one child.
➢ Task 2: Select variables for analysis that match the scenario
developed in Task 1.The data set
provides information on household consumption; there are
socioeconomic variables and
expenditures variables. The socioeconomic variable names start
with “SE-” and the expenditure
variable names start with a “USD;” all expenditures are in US
dollars. All students must use
income as one variable. Select two additional socioeconomic
variables (one qualitative and one
quantitative) and two expenditures for your analysis that match
the scenario you developed for
Task 1. For instance, using the example scenario of a 35 year
old single parent with a high
https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
30. https://www.bls.gov/cex/
https://www.bls.gov/cex/
https://www.bls.gov/cex/
STAT200: Written Assignment #1 - Descriptive Statistics Data
Analysis Plan - Instructions
Page 2 of 4
school diploma and one child, you could select “income,”
“education,” and “number of children”
as socioeconomic variables and then pick two household
expenditure items to show the
distribution of costs and compare that with your income. When
selecting variables, think about
the following three questions:
o Why am I choosing these variables?
o What interests me about these variables?
o What do I think will be the outcome?
➢ Task 3: Determine appropriate measures of central tendency
and dispersion for the selected
variables. For each quantitative variable, select at least one
measure of central tendency and at
least one measure of dispersion (Please see below table for list
of measures). For the qualitative
31. variable, select one measure of central tendency. When
determining the measures of central
tendency and dispersion, think about what is appropriate given
the level of measurement and
type of variable. Recommend referring to the text and
information posted in our LEO classroom
to help with this task (Note: you will use this information to
provide a rationale for your choice
of measures).
Measures of Central Tendency Measures of Dispersion
● Mean
● Mode
● Median
● Range
● Sample Standard Deviation
● Variance
➢ Task 4: Determine appropriate graph and/or table for each of
the selected variables. Select
one graph or table for each variable (Please see below table for
list of graphs and tables). When
determining the graphs and tables, think about what is
appropriate given the level of
measurement and type of variable. Recommend referring to the
32. text and information posted in
our LEO classroom to help with this task (Note: you will use
this information to provide a
rationale for your choice of graphs and/or tables).
Types of Graphs Types of Tables
● Pie Chart
● Bar Chart
● Histogram
● Box Plots (also known as Box-and-Whiskers Plot)
● Frequency Table
● Relative Frequency Table
● Grouped Frequency Table
STAT200: Written Assignment #1 - Descriptive Statistics Data
Analysis Plan - Instructions
Page 3 of 4
Step #3: Complete the “Assignment #1: Descriptive Statistics
Data Analysis Plan Template.”
Remember, you will not be conducting any statistical analysis,
drawing any graphs, or compiling any
tables for the first assignment. Rather, you need to wait for
feedback from your instructor on this
assignment and use that feedback to complete Assignment #2.
33. Here are the main sections for this assignment (i.e., completing
the plan template):
✓ Identifying Information. Fill in information on name, class,
instructor, and date.
✓ Scenario. In this section, briefly (2-3 sentences) describe the
scenario you developed in Step #2,
Task 1.
✓ Complete Table 1: Variables Selected for the Analysis. Enter
information the variables selected
for analysis in Step #2, Task 2. For each selected variable be
sure to include its: name as listed in
the data set, description, and variable type.
✓ Reason(s) for Selecting the Variables and Expected
Outcome(s): In this section, for each
selected variable, please answer the following questions:
✓ Why did I choose this variable?
✓ What interests me about this variable?
✓ What do I think will be the outcome?
✓ Complete Table 2. Numerical Summaries of the Selected
Variables. Enter information on
selected measures of central tendency and dispersion for each
selected variable. Be sure to
34. briefly explain why you choose those measurements. Note: The
information for the required
variable, “Income,” has already been completed and can be used
as a guide for completing
information on the remaining variables.
✓ Complete Table 3. Type of Graphs and/or Tables for Selected
Variables. Enter information on
selected graph and/or table for each selected variable. Be sure
to briefly explain why you
choose those measurements. Note: The information for the
required variable, “Income,” has
already been completed and can be used as a guide for
completing information on the
remaining variables.
Assignment Submission: Name the file that contains your
completed “Assignment #1: Descriptive
Statistics Data Analysis Plan Template” using the following
format: “Assignment1-StudentLastName.”
STAT200: Written Assignment #1 - Descriptive Statistics Data
Analysis Plan - Instructions
Page 4 of 4
35. Then, submit the file via the Assignments area in the LEO
classroom in the “Assignment #1: Descriptive
Statistics Data Analysis Plan” folder and wait for your
instructor’s feedback.
Grading Rubric for Written Assignment #1
Scenario and Selection of Related Variables
● Clear description of scenario
● Selected variables and reasons are appropriate for the
scenario.
20%
Selection of Measures of Central Tendency and Dispersion
For each variable:
● Appropriate measures selected.
● Rationale is provided and appropriate.
30%
Selection of Graphs and/or Tables
For each variable:
● Appropriate measures selected.
● Rationale is provided and appropriate.