Analyzing and Visualizing Data
Residency Session
10/4/2019 – 10/6/2019
Dr. Dexter Francis
UC Residency Weekend Schedule
Friday:
Check In: 4PM
Class: 5PM – 10PM *No Meal Breaks
Saturday:
Check In: 7AM
Class: 8AM – 7:30PM *Lunch Break
Sunday:
Check In: 7AM
Class: 8AM – 1PM *No Meal Breaks
Agenda - Overall
Friday
Introductions
Project Review
Dataset Selection
Group Planning Session (I will spend 10-15mins with each group)
Group Work
Q&A Session
Saturday
Group Work
Q&A Session
Sunday
Group Presentations (15 - 20 mins)
Q&A Session
Wrap Up
Residency Assessments
Friday: Reflective Assessment – Data Scientist Resume
Saturday: Reflective Assessment – Characteristics of A Data Whisperer
Sunday: Reflective Assessment – Modern Data Scientist
Example –
Review the data scientist resume sample provided in this folder. Select and discuss 2 components of the data scientist resume that relates to your future or current job.
Your reflective assessment paper should be at 100 words, double-spaced, and typed in an easy-to-read MS Word (other processors are fine to use but save it in MS Word format).
Residency Assignment – Research: 5 Sections
Example -
Topic: The impact on student learning in small class sizes in the United States
Dataset: dataset_Star.csv
Resource File: R_The impact on student learning and small class sizes in the United States.pdf
File Location: “Lesson 6” folder in iLearn
**Must be APA formatted**
Title Page
Document body with citations
Reference Page
Residency Assignment – Research: 5 Sections
Section 1: Discuss Topic Background (Citations are required)
Section 2: Data Analysis (R Studio - R language - Library: mosaic)
Describe the fields of the dataset.
Create a summary of stats for the dataset.
Discuss the Min, Max, Median, and Mean of the continuous fields.
Discuss the Counts and Percentages of the categorical fields.
Discuss any missing data elements.
Residency Assignment – Research: 5 Sections
Section 3: Data Visualizations (R Studio - R language - Library: ggplot2)
Provided Graphs: Bar Plot, Box Plot, Scatter Plot, and Histogram
Discuss: Plot, Breakout, Drilldown, and Position (only Bar Plot)
Findings: What story is presented in the visualizations?
Residency Assignment – Research: 5 Sections
Section 4: Discuss Findings (Citations are required)
Compare (Similar)
Contrast (Difference)
Articles that Support Information
Conclusion of the findings
Section 5: Reference List
At least 8 References (4 must be scholarly peer-reviewed articles)
Residency Assignment – Presentation
Power Point Slides:
Summarize research
Data visualizations
Findings
Conclusion
Time: 15 - 20 minutes
Residency Assignment – Topics
Research Topics
A comparison of health insurance coverage and hours worked by wives in the United States
Preferences for car choice in the United States
The benefits of bio-feedback treatment for migraine headaches
The change in demand for medical car.
2. Friday
Introductions
Project Review
Dataset Selection
Group Planning Session (I will spend 10-15mins with each
group)
Group Work
Q&A Session
Saturday
Group Work
Q&A Session
Sunday
Group Presentations (15 - 20 mins)
Q&A Session
Wrap Up
Residency Assessments
Friday: Reflective Assessment – Data Scientist Resume
Saturday: Reflective Assessment – Characteristics of A Data
Whisperer
Sunday: Reflective Assessment – Modern Data Scientist
Example –
Review the data scientist resume sample provided in this folder.
Select and discuss 2 components of the data scientist resume
that relates to your future or current job.
3. Your reflective assessment paper should be at 100 words,
double-spaced, and typed in an easy-to-read MS Word (other
processors are fine to use but save it in MS Word format).
Residency Assignment – Research: 5 Sections
Example -
Topic: The impact on student learning in small class sizes in the
United States
Dataset: dataset_Star.csv
Resource File: R_The impact on student learning and small
class sizes in the United States.pdf
File Location: “Lesson 6” folder in iLearn
**Must be APA formatted**
Title Page
Document body with citations
Reference Page
Residency Assignment – Research: 5 Sections
Section 1: Discuss Topic Background (Citations are required)
Section 2: Data Analysis (R Studio - R language - Library:
mosaic)
4. Describe the fields of the dataset.
Create a summary of stats for the dataset.
Discuss the Min, Max, Median, and Mean of the continuous
fields.
Discuss the Counts and Percentages of the categorical fields.
Discuss any missing data elements.
Residency Assignment – Research: 5 Sections
Section 3: Data Visualizations (R Studio - R language - Library:
ggplot2)
Provided Graphs: Bar Plot, Box Plot, Scatter Plot, and
Histogram
Discuss: Plot, Breakout, Drilldown, and Position (only Bar Plot)
Findings: What story is presented in the visualizations?
Residency Assignment – Research: 5 Sections
Section 4: Discuss Findings (Citations are required)
Compare (Similar)
Contrast (Difference)
Articles that Support Information
Conclusion of the findings
Section 5: Reference List
At least 8 References (4 must be scholarly peer-reviewed
5. articles)
Residency Assignment – Presentation
Power Point Slides:
Summarize research
Data visualizations
Findings
Conclusion
Time: 15 - 20 minutes
Residency Assignment – Topics
Research Topics
A comparison of health insurance coverage and hours worked by
wives in the United States
Preferences for car choice in the United States
The benefits of bio-feedback treatment for migraine headaches
The change in demand for medical care in the United States
The characteristics that give round cut diamonds value
The impact of living standards on medical expenses in Vietnam
The impact of unemployment insurance on blue collar workers
6. in the United States
The impact on student learning and small class sizes in the
United States
The results of health reform in German after 1988
Wages and Education of Young Males
Wife Working Hours
Visits to Physician Office
Cigarette Consumption
Choice of Brand for Crackers
Homework
Complete the following homework scenario:
Bob and Lisa are both married, working adults. They both plan
for retirement and consider the $2,000 annual contribution a
must.
7. First, consider Lisa’s savings. She began working at age 20 and
began making an annual contribution of $2,000 at the first of
the year beginning with her first year. She makes 13
contributions. She worked until she was 32 and then left full
time work to have children and be a stay at home mom. She left
her IRA invested and plans to begin drawing from her IRA
when she is 65.
Bob started his IRA at age 32. The first 12 years of his working
career, he used his discretionary income to buy a home, upgrade
the family cars, take vacations, and pursue his golfing hobby.
At age 32, he made his first $2,000 contribution to an IRA, and
contributed $2,000 every year up until age 65, a total of 33
years / contributions. He plans to retire at age 65 and make
withdrawals from his IRA.
Both IRA accounts grow at a 7% annual rate. Do not consider
any tax effect.
Write a two to three (2-3) paragraph summary in which you:
Create a chart summarizing the details of the investment for
both Bob and Lisa.
Explain the results in terms of time value of money.
· Using the following links to the Frontline videos and then
answer the following questions:
. Short Version - https://youtu.be/HX6Fg62l0e8
. Long Version - https://youtu.be/7MXOr3KELqE
1. What was the impact of the near failure of Bear Stearns and
the failure of Lehman Brothers on Money Markets?
2. What actions did the Federal Reserve and the Treasury
Department take? What were the impacts of the decisions if
any?
8. Homework
Complete the following homework scenario:
Select one (1) U.S. publicly traded company and review its most
recent Annual Report. (You may use one (1) of the three (3)
companies you selected for your Stock Journal assignment.)
Use the Income Statement and Balance Sheet to determine the
changes in:
assets, liabilities, and equity total revenue and net income
Briefly describe the change from the current and prior years in
each of these key areas and determine if the changes would be
positive or negative from an investor / stockholder’s view.
Determine the current value of your total investment. Do not
make any changes to your investment at this time. Calculate
your total based on the number of shares and the new price per
share, for each company.
Provide your opinion / assessment of your investments.
Evaluate the results of your current investment. Are you happy
with the result and the trend? Are you upset because you’re
investment is worth less than $25,000. Feel free to speculate /
guess at why you believe the stock increased, decreased, or
remained static.
9. Complete the following homework scenario:
Required:
Compare the results of the three (3) methods by quality of
information for decision making. Using what you have learned
about the three (3) methods, identify the best project by the
criteria of long term increase in value. (You do not need to do
further research.) Convey your understanding of the Time Value
of Money principles used or not used in the three (3) methods.
Review the video titled “NPV, IRR, MIRR for Mac and PC
Excel” (located at
https://www.youtube.com/watch?v=C7CryVgFbBc and
previously listed in Week 4) to help you understand the
foundational concepts:
Scenario Information:
Assume that two gas stations are for sale with the following
10. cash flows; CF1 is the Cash Flow in the first year, and CF2 is
the Cash Flow in the second year. This is the time line and data
used in calculating the Payback Period, Net Present Value, and
Internal Rate of Return. The calculations are done for you. Your
task is to select the best project and explain your decision. The
methods are presented and the decision each indicates is given
below.
Investment Sales Price CF1 CF2
Gas Station A $50,000 $0 $100,000
Gas Station B $50,000 $50,000 $25,000
Three (3) Capital Budgeting Methods are presented:
Payback Period: Gas Station A is paid back in 2 years; CF1 in
year 1, and CF2 in year 2. Gas Station B is paid back in one (1)
year. According to the payback period, when given the choice
between two mutually exclusive projects, the investment paid
back in the shortest time is selected.
Net Present Value: Consider the gas station example above
under the NPV method, and a discount rate of 10%:
NPVgas station A = $100,000/(1+.10)2 - $50,000 = $32,644
NPVgas station B = $50,000/(1+.10) + $25,000/(1+.10)2 -
$50,000 = $16,115
Internal Rate of Return: Assuming 10% is the cost of funds; the
IRR for Station A is 41.421%.; for Station B, 36.602.
Summary of the Three (3) Methods:
Gas Station B should be selected, as the investment is returned
in 1 period rather than 2 periods required for Gas Station A.
Under the NPV criteria, however, the decision favors gas station
A, as it has the higher net present value. NPV is a measure of
the value of the investment.
The IRR method favors Gas Station A. as it has a higher return,
exceeding the cost of funds (10%) by the highest return.
11. Homework
Complete the following homework scenario:
Bob and Lisa are both married, working adults. They both plan
for retirement and consider the $2,000 annual contribution a
must.
First, consider Lisa’s savings. She began working at age 20 and
began making an annual contribution of $2,000 at the first of
the year beginning with her first year. She makes 13
contributions. She worked until she was 32 and then left full
time work to have children and be a stay at home mom. She left
her IRA invested and plans to begin drawing from her IRA
when she is 65.
Bob started his IRA at age 32. The first 12 years of his working
career, he used his discretionary income to buy a home, upgrade
the family cars, take vacations, and pursue his golfing hobby.
At age 32, he made his first $2,000 contribution to an IRA, and
contributed $2,000 every year up until age 65, a total of 33
years / contributions. He plans to retire at age 65 and make
withdrawals from his IRA.
Both IRA accounts grow at a 7% annual rate. Do not consider
any tax effect.
Write a two to three (2-3) paragraph summary in which you:
Create a chart summarizing the details of the investment for
both Bob and Lisa.
Explain the results in terms of time value of money.
· Using the following links to the Frontline videos and then
answer the following questions:
. Short Version - https://youtu.be/HX6Fg62l0e8
12. . Long Version - https://youtu.be/7MXOr3KELqE
1. What was the impact of the near failure of Bear Stearns and
the failure of Lehman Brothers on Money Markets?
2. What actions did the Federal Reserve and the Treasury
Department take? What were the impacts of the decisions if
any?
Homework
Complete the following homework scenario:
Select one (1) U.S. publicly traded company and review its most
recent Annual Report. (You may use one (1) of the three (3)
companies you selected for your Stock Journal assignment.)
Use the Income Statement and Balance Sheet to determine the
changes in:
assets, liabilities, and equity total revenue and net income
Briefly describe the change from the current and prior years in
each of these key areas and determine if the changes would be
positive or negative from an investor / stockholder’s view.
Determine the current value of your total investment. Do not
make any changes to your investment at this time. Calculate
your total based on the number of shares and the new price per
share, for each company.
Provide your opinion / assessment of your investments.
Evaluate the results of your current investment. Are you happy
13. with the result and the trend? Are you upset because you’re
investment is worth less than $25,000. Feel free to speculate /
guess at why you believe the stock increased, decreased, or
remained static.
Complete the following homework scenario:
Required:
Compare the results of the three (3) methods by quality of
information for decision making. Using what you have learned
about the three (3) methods, identify the best project by the
criteria of long term increase in value. (You do not need to do
further research.) Convey your understanding of the Time Value
of Money principles used or not used in the three (3) methods.
14. Review the video titled “NPV, IRR, MIRR for Mac and PC
Excel” (located at
https://www.youtube.com/watch?v=C7CryVgFbBc and
previously listed in Week 4) to help you understand the
foundational concepts:
Scenario Information:
Assume that two gas stations are for sale with the following
cash flows; CF1 is the Cash Flow in the first year, and CF2 is
the Cash Flow in the second year. This is the time line and data
used in calculating the Payback Period, Net Present Value, and
Internal Rate of Return. The calculations are done for you. Your
task is to select the best project and explain your decision. The
methods are presented and the decision each indicates is given
below.
Investment Sales Price CF1 CF2
Gas Station A $50,000 $0 $100,000
Gas Station B $50,000 $50,000 $25,000
Three (3) Capital Budgeting Methods are presented:
Payback Period: Gas Station A is paid back in 2 years; CF1 in
year 1, and CF2 in year 2. Gas Station B is paid back in one (1)
year. According to the payback period, when given the choice
between two mutually exclusive projects, the investment paid
back in the shortest time is selected.
Net Present Value: Consider the gas station example above
under the NPV method, and a discount rate of 10%:
NPVgas station A = $100,000/(1+.10)2 - $50,000 = $32,644
NPVgas station B = $50,000/(1+.10) + $25,000/(1+.10)2 -
$50,000 = $16,115
Internal Rate of Return: Assuming 10% is the cost of funds; the
IRR for Station A is 41.421%.; for Station B, 36.602.
Summary of the Three (3) Methods:
Gas Station B should be selected, as the investment is returned
15. in 1 period rather than 2 periods required for Gas Station A.
Under the NPV criteria, however, the decision favors gas station
A, as it has the higher net present value. NPV is a measure of
the value of the investment.
The IRR method favors Gas Station A. as it has a higher return,
exceeding the cost of funds (10%) by the highest return.