SlideShare a Scribd company logo
1 of 31
Download to read offline
Final Presentation
-- Yueyao Wang
CONTENT
●Part I: Self-introduction slides
headshot and one paragraph of self-introduction
Github Repo Link
Kaggle Notebook Link(s)
LinkedIn URL
●Part II: Summary of what I have learned in this course and my takeaway for personal and professional growth
●Part III: My own market research report
Session 1. a new dataset that is not one of the instruction datasets we used in this course.
Session 2. Reproduce: Capstone Project Milestone 2: Research Design and The Data
Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing
●Part VI: Appendix
Capstone Project Milestone 2: Research Design and The Data
Capstone Project Milestone 3: Hypothesis Testing
Capstone Project Milestone 4: Regression
Capstone Project Milestone 5: Clustering
Self-introduction
• I come from Nanjing, China. After finishing my bachelor degree of Business English,
I continue to study Integrated Marketing in New York University. In 2019, I had an
internship in IKEA Nanjing for about 2 months. Although it was not a long time, I
learned some basic knowledge relating to marketing which was very important and
helpful to me. Also, the leader taught me the significance of pricing based on market
research.
• In the university, when I was a junior student, I participated in the Business English
Contest. In this contest, every team needed to plan a crossover joint of two different
brands. During this process, I found my interest and creation of brand marketing, so
it gave me a small direction in the future. My hobby in the daily life is dance—jazz
which is my favorite hobby.
• Tel: 8613057556779 Email: yw5244@nyu.edu
Yueyao Wang
Self-introduction
• Github Report URL: https://colab.research.google.com/github/wyyyyy-
627/NYU_Integrated_Marketing
• Kaggle Notebook Link: https://www.kaggle.com/yueyaowang/customer-segementation-
yw5244
• LinkedIn URL: http://www.linkedin.com/in/Winnie-Wang627
What I have learned in this course
• In this course, I have learned a lot about statistics. Because in my college life I majored in
arts, I did not have many knowledge related to Math. At first, I feel a little bit afraid of this
course. However, recently, I am confident that I have the ability to handle it!
• When I do my own Hypothesis Testing for the final presentation, I figure out the problem of
uploading data independently. I think this process is very meaningful to me and I also gain a
sense of achievement.
My Progress Chart
The source of my data
●Name of Data : Students Performance
●Link:
https://www.kaggle.com/spscientist/students-performance-in-
exams?select=StudentsPerformance.csv
●Summary of Data:evaluate the writing/reading/math
score from 5 angles-- gender, race, parental level of
education, lunch, test preparation; 1000 samples
Session1. Reproduce: Capstone Project Milestone 2: Research Design and
The Data
• https://datastudio.google.com/u/0/reporting/57c99570-7075-4c82-bc5d-095f8791adb9/page/vsQrB/edit
Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing
From this chart, we can
see the whole data.
Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing
One-Sample T-test
Reason:
In order to observe the mean of
Math score, so we use one-
sample T-test.
Null hypothesis:
The mean of Math score equals
to 60.
Conclusion:
Since the p-value is smaller
than 0.05, reject the null
hypothesis test. The mean of
Math score is larger than 66.
Two-Sample T-Test
Reason:
In order to observe the relationship between
two elements, so we use two-sample T-test.
Null hypothesis:
Lunch will have impact on students Math
score.
Conclusion:
Since the p-value is smaller than 0.05, reject
the null hypothesis test. Therefore, there is
no relationship between the free/ reduced
lunch and those standard.
Power analysis: T-test
Conclusion: For a 0.05 cohen d effect size, a power of 0.80, and a
type I error of 0.05, we need a sample size of 6280 (for each group).
summary
Limitations:
We can test the more detailed category of lunch. Also, we can have larger data sample.
Conclusions:
The parental level of education may have a huge effect on students performance. However, the lunch
standard has no relationship with Math score.
Appendix
Capstone Project Milestone 2: Research Design and The Data
Capstone Project Milestone 3: Hypothesis Testing
data of a Portuguese banking institution
URL: https://data.world/data-society/bank-marketing-data
the GDP data of countries from G20 countries
URL:https://stats.oecd.org/index.aspx?queryid=33940#
Github Report URL:https://github.com/wyyyyy-
627/NYU_Integrated_Marketing
1
2
3
Data Sources
Paried T-test
Reason: The chart can verify and compare the fluctuation of GDP volume before and after COVID-19,
which is from 2018 to 2020.
Null hypothesis: The countries’ GDP are almost the same in 2018 Q2 and 2020 Q2.
Conclusion: Since p-value is smaller than 0.05, reject the null hypothesis test. Most countries' GDP in
2018 is higher than GDP in 2020, and COVID-19 haves a negative impact on the country’s GDP.
Assumption
Reason:
I find that the linear relationship between
variables and the direction of the correlation.
Therefore, I can choose Pearson correlation
to test.
Null hypothesis:
GDP of the same country in 2018 Q2 and
2020 Q2 are not correlated.
Conclusion:
Since the p-value is smaller than 0.05, reject
the null hypothesis test. The null hypothesis
that GDP for same country in 2018 and 2020
are not correlated.
Two-Sample T-Test
Reason:
In order to observe the
relationship between two
elements, so we use two-sample
T-test.
Null hypothesis:
Balance of people with a loan is
same as those without.
Conclusion:
Since the p-value is smaller than
0.05, reject the null hypothesis
test. Therefore, there is no
relationship between the balance
of people with a loan and those
without.
Power analysis: T-test
Conclusion: For a 0.2 cohen d effect size, a power of 0.80, and a type I error of 0.05, we need a
sample size of 393 (for each group).
summary
Limitations:
We can test the effect of COVID-19 on the GDP of G20 countries more cpmprehensively, such as the data of 2021 in
the future. Also, we can have larger data sample.
Conclusions:
COVID-19 has a significant impact on the economies and GDP of G20 countries.
Following the outbreak of COVID-19, countries are correlating. People have loan balances are different from those that
don't have.
Capstone Project Milestone 4: Regression
►Summary of the data sources:
Based on the 4250 samples, I will analyze whether a customer will change telecommunications
provider, something known as “churning”.
►The regression model you choose and the result:
The regression model I choose is OLS. The result shows that length of time will influence the
charge, then the customers reconsider their choice.
►Github Report URL: https://colab.research.google.com/github/wyyyyy-
627/NYU_Integrated_Marketing/blob/main
►URL of data sources: https://www.kaggle.com/c/customer-churn-prediction-2020
Scatterplots
In these two charts, we will find that total
day calls and total night charge have no
linear relationship. However, total day
minutes and total day charge have linear
relationship.
Regression Result
When X1- total day calls, X2- total day minutes, the P-value is 0.86 and 0.513. Both P-value are larger
than 0.05, so we do not reject the hypothesis. There is no linear relationships.
Insights Gained from the Regression
From the only one linear relationship-- total day minutes and total day charge, I know that the
length of time will greatly influence the money. Therefore, the company can lower the money they
charge per minute. Also, they can publish more variety of phone plans, which can decrease the
total day charge and attract more customers.
Although total day calls and total day minutes do not have linear relationship with total night
charge, we should still pay attention to total night charge. For example, the night charge can be
lower than the day.
Assumptions Check and Further Research
Firstly, the scatter plot shows that there is
no correlation.(Check Assumption 2,4)
Secondly, a histogram of the residuals
i n d i c a t e s t h a t i t i s n o r m a l l y
distributed.(Check Assumption 1 and 3)
Thirdly, the P-value is 0.961 which is
larger than 0.05, so the independent
v a r i a b l e s a r e c o r r e l a t e d . ( C h e c k
Assumption6) Therefore, all the
assumptions satisfy the results.
As the further research, we need to find
more relationship which may cause the
sales decline. For example, we can
analyze the relationship between night
calls and night charge.
Capstone Project Milestone 5: Clustering
• Data sourcd URL: https://www.kaggle.com/hellbuoy/online-retail-customer-clustering
• The URL to my Kaggle.com link: https://www.kaggle.com/yueyaowang/customer-
segementation-yw5244
• summary of the data sources: Online retail is a transnational data set which contains all
the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and
registered non-store online retail. The company mainly sells unique all-occasion gifts.
Many customers of the company are wholesalers.
• the statistic methods you choose and the result: The country I choose is Germany. K-
mean cluster analysis and hierarchical clustering are two kind of method to research the
case and get the result. Hierarchical clustering: 2 target customer return.
K-Means Clustering
When metric=”calinski_harabasz”:
Computes the ratio of dispersion between and
within clusters, and base on the research the K=4.
Interpreting the Clustering
Based on the RFM rule, we should choose the customer
clusters with a lower recency, a higher frequency and
amount. From the K-means clustering results, we can find
that see that customers with Cluster_Id=1 best fits the
criteria.
Hierarchical Clustering
Hierachical clustering visualize tree by linkage
methods. In complete linkage hierachical clustering,
the distance between two clusters is defined as the
longest distance between two points in each cluster.
Hierarchical Clustering Analysis
Based on the RFM criteria, we should choose the customer
clusters with a lower recency, a higher frequency and amount.
From the K-means clustering results, we can find that see
that customers with Cluster_Id=1 best fits the criteria.
That's all!
Thank you!

More Related Content

Similar to Final Presentation Slide--yw5244

statistical measurement project presentation
statistical measurement project presentationstatistical measurement project presentation
statistical measurement project presentationKexinZhang22
 
wt2084 final presentation slides
wt2084 final presentation slideswt2084 final presentation slides
wt2084 final presentation slidesWeixiTan
 
Final Presentation
Final PresentationFinal Presentation
Final Presentationssuseraf9eb5
 
Final presentation zg2088
Final presentation zg2088Final presentation zg2088
Final presentation zg2088ssuserd6504f
 
Yx2489 final presentation slides
Yx2489 final presentation slidesYx2489 final presentation slides
Yx2489 final presentation slidesYiXu86
 
statistical measurement project present
statistical measurement project presentstatistical measurement project present
statistical measurement project presentKexinZhang22
 
statistical measurement project present
statistical measurement project presentstatistical measurement project present
statistical measurement project presentKexinZhang22
 
statistical measurement project presentation
statistical measurement project presentationstatistical measurement project presentation
statistical measurement project presentationKexinZhang22
 
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate Report on Opening Dutch Bangla Bank Fast Track in Collage Gate
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate Ashikur Rahman
 
Experimentation at Scale
Experimentation at ScaleExperimentation at Scale
Experimentation at ScaleAndy Edmonds
 
Instructions - Read FirstInstructions The following worksheets .docx
Instructions - Read FirstInstructions The following worksheets .docxInstructions - Read FirstInstructions The following worksheets .docx
Instructions - Read FirstInstructions The following worksheets .docxdirkrplav
 
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bank
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test BankDiscovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bank
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bankguzofahug
 
Intro to Data Analytics with Oscar's Director of Product
 Intro to Data Analytics with Oscar's Director of Product Intro to Data Analytics with Oscar's Director of Product
Intro to Data Analytics with Oscar's Director of ProductProduct School
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatisticsAaron Sankey
 
Anatomy of a Data Product and Lending Club Data
Anatomy of a Data Product and Lending Club DataAnatomy of a Data Product and Lending Club Data
Anatomy of a Data Product and Lending Club DataSri Ambati
 

Similar to Final Presentation Slide--yw5244 (20)

Yg2298
Yg2298Yg2298
Yg2298
 
statistical measurement project presentation
statistical measurement project presentationstatistical measurement project presentation
statistical measurement project presentation
 
wt2084 final presentation slides
wt2084 final presentation slideswt2084 final presentation slides
wt2084 final presentation slides
 
Final Presentation
Final PresentationFinal Presentation
Final Presentation
 
Final presentation zg2088
Final presentation zg2088Final presentation zg2088
Final presentation zg2088
 
Yx2489 final presentation slides
Yx2489 final presentation slidesYx2489 final presentation slides
Yx2489 final presentation slides
 
statistical measurement project present
statistical measurement project presentstatistical measurement project present
statistical measurement project present
 
statistical measurement project present
statistical measurement project presentstatistical measurement project present
statistical measurement project present
 
statistical measurement project presentation
statistical measurement project presentationstatistical measurement project presentation
statistical measurement project presentation
 
De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
 
Cost Estimation
Cost EstimationCost Estimation
Cost Estimation
 
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate Report on Opening Dutch Bangla Bank Fast Track in Collage Gate
Report on Opening Dutch Bangla Bank Fast Track in Collage Gate
 
Experimentation at Scale
Experimentation at ScaleExperimentation at Scale
Experimentation at Scale
 
SPC Presentation Master
SPC Presentation MasterSPC Presentation Master
SPC Presentation Master
 
Instructions - Read FirstInstructions The following worksheets .docx
Instructions - Read FirstInstructions The following worksheets .docxInstructions - Read FirstInstructions The following worksheets .docx
Instructions - Read FirstInstructions The following worksheets .docx
 
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bank
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test BankDiscovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bank
Discovering Statistics Using IBM SPSS Statistics 4th Edition Field Test Bank
 
Intro to Data Analytics with Oscar's Director of Product
 Intro to Data Analytics with Oscar's Director of Product Intro to Data Analytics with Oscar's Director of Product
Intro to Data Analytics with Oscar's Director of Product
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
 
Building a Data Driven Business
Building a Data Driven BusinessBuilding a Data Driven Business
Building a Data Driven Business
 
Anatomy of a Data Product and Lending Club Data
Anatomy of a Data Product and Lending Club DataAnatomy of a Data Product and Lending Club Data
Anatomy of a Data Product and Lending Club Data
 

Recently uploaded

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationNeilDeclaro1
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptNishitharanjan Rout
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfPondicherry University
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactisticshameyhk98
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 

Recently uploaded (20)

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health Education
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Philosophy of china and it's charactistics
Philosophy of china and it's charactisticsPhilosophy of china and it's charactistics
Philosophy of china and it's charactistics
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 

Final Presentation Slide--yw5244

  • 2. CONTENT ●Part I: Self-introduction slides headshot and one paragraph of self-introduction Github Repo Link Kaggle Notebook Link(s) LinkedIn URL ●Part II: Summary of what I have learned in this course and my takeaway for personal and professional growth ●Part III: My own market research report Session 1. a new dataset that is not one of the instruction datasets we used in this course. Session 2. Reproduce: Capstone Project Milestone 2: Research Design and The Data Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing ●Part VI: Appendix Capstone Project Milestone 2: Research Design and The Data Capstone Project Milestone 3: Hypothesis Testing Capstone Project Milestone 4: Regression Capstone Project Milestone 5: Clustering
  • 3. Self-introduction • I come from Nanjing, China. After finishing my bachelor degree of Business English, I continue to study Integrated Marketing in New York University. In 2019, I had an internship in IKEA Nanjing for about 2 months. Although it was not a long time, I learned some basic knowledge relating to marketing which was very important and helpful to me. Also, the leader taught me the significance of pricing based on market research. • In the university, when I was a junior student, I participated in the Business English Contest. In this contest, every team needed to plan a crossover joint of two different brands. During this process, I found my interest and creation of brand marketing, so it gave me a small direction in the future. My hobby in the daily life is dance—jazz which is my favorite hobby. • Tel: 8613057556779 Email: yw5244@nyu.edu Yueyao Wang
  • 4. Self-introduction • Github Report URL: https://colab.research.google.com/github/wyyyyy- 627/NYU_Integrated_Marketing • Kaggle Notebook Link: https://www.kaggle.com/yueyaowang/customer-segementation- yw5244 • LinkedIn URL: http://www.linkedin.com/in/Winnie-Wang627
  • 5. What I have learned in this course • In this course, I have learned a lot about statistics. Because in my college life I majored in arts, I did not have many knowledge related to Math. At first, I feel a little bit afraid of this course. However, recently, I am confident that I have the ability to handle it! • When I do my own Hypothesis Testing for the final presentation, I figure out the problem of uploading data independently. I think this process is very meaningful to me and I also gain a sense of achievement.
  • 7. The source of my data ●Name of Data : Students Performance ●Link: https://www.kaggle.com/spscientist/students-performance-in- exams?select=StudentsPerformance.csv ●Summary of Data:evaluate the writing/reading/math score from 5 angles-- gender, race, parental level of education, lunch, test preparation; 1000 samples
  • 8. Session1. Reproduce: Capstone Project Milestone 2: Research Design and The Data • https://datastudio.google.com/u/0/reporting/57c99570-7075-4c82-bc5d-095f8791adb9/page/vsQrB/edit
  • 9. Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing From this chart, we can see the whole data.
  • 10. Session 3. Reproduce: Capstone Project Milestone 3: Hypothesis Testing One-Sample T-test Reason: In order to observe the mean of Math score, so we use one- sample T-test. Null hypothesis: The mean of Math score equals to 60. Conclusion: Since the p-value is smaller than 0.05, reject the null hypothesis test. The mean of Math score is larger than 66.
  • 11. Two-Sample T-Test Reason: In order to observe the relationship between two elements, so we use two-sample T-test. Null hypothesis: Lunch will have impact on students Math score. Conclusion: Since the p-value is smaller than 0.05, reject the null hypothesis test. Therefore, there is no relationship between the free/ reduced lunch and those standard.
  • 12. Power analysis: T-test Conclusion: For a 0.05 cohen d effect size, a power of 0.80, and a type I error of 0.05, we need a sample size of 6280 (for each group).
  • 13. summary Limitations: We can test the more detailed category of lunch. Also, we can have larger data sample. Conclusions: The parental level of education may have a huge effect on students performance. However, the lunch standard has no relationship with Math score.
  • 14. Appendix Capstone Project Milestone 2: Research Design and The Data
  • 15. Capstone Project Milestone 3: Hypothesis Testing data of a Portuguese banking institution URL: https://data.world/data-society/bank-marketing-data the GDP data of countries from G20 countries URL:https://stats.oecd.org/index.aspx?queryid=33940# Github Report URL:https://github.com/wyyyyy- 627/NYU_Integrated_Marketing 1 2 3 Data Sources
  • 16. Paried T-test Reason: The chart can verify and compare the fluctuation of GDP volume before and after COVID-19, which is from 2018 to 2020. Null hypothesis: The countries’ GDP are almost the same in 2018 Q2 and 2020 Q2. Conclusion: Since p-value is smaller than 0.05, reject the null hypothesis test. Most countries' GDP in 2018 is higher than GDP in 2020, and COVID-19 haves a negative impact on the country’s GDP.
  • 17. Assumption Reason: I find that the linear relationship between variables and the direction of the correlation. Therefore, I can choose Pearson correlation to test. Null hypothesis: GDP of the same country in 2018 Q2 and 2020 Q2 are not correlated. Conclusion: Since the p-value is smaller than 0.05, reject the null hypothesis test. The null hypothesis that GDP for same country in 2018 and 2020 are not correlated.
  • 18. Two-Sample T-Test Reason: In order to observe the relationship between two elements, so we use two-sample T-test. Null hypothesis: Balance of people with a loan is same as those without. Conclusion: Since the p-value is smaller than 0.05, reject the null hypothesis test. Therefore, there is no relationship between the balance of people with a loan and those without.
  • 19. Power analysis: T-test Conclusion: For a 0.2 cohen d effect size, a power of 0.80, and a type I error of 0.05, we need a sample size of 393 (for each group).
  • 20. summary Limitations: We can test the effect of COVID-19 on the GDP of G20 countries more cpmprehensively, such as the data of 2021 in the future. Also, we can have larger data sample. Conclusions: COVID-19 has a significant impact on the economies and GDP of G20 countries. Following the outbreak of COVID-19, countries are correlating. People have loan balances are different from those that don't have.
  • 21. Capstone Project Milestone 4: Regression ►Summary of the data sources: Based on the 4250 samples, I will analyze whether a customer will change telecommunications provider, something known as “churning”. ►The regression model you choose and the result: The regression model I choose is OLS. The result shows that length of time will influence the charge, then the customers reconsider their choice. ►Github Report URL: https://colab.research.google.com/github/wyyyyy- 627/NYU_Integrated_Marketing/blob/main ►URL of data sources: https://www.kaggle.com/c/customer-churn-prediction-2020
  • 22. Scatterplots In these two charts, we will find that total day calls and total night charge have no linear relationship. However, total day minutes and total day charge have linear relationship.
  • 23. Regression Result When X1- total day calls, X2- total day minutes, the P-value is 0.86 and 0.513. Both P-value are larger than 0.05, so we do not reject the hypothesis. There is no linear relationships.
  • 24. Insights Gained from the Regression From the only one linear relationship-- total day minutes and total day charge, I know that the length of time will greatly influence the money. Therefore, the company can lower the money they charge per minute. Also, they can publish more variety of phone plans, which can decrease the total day charge and attract more customers. Although total day calls and total day minutes do not have linear relationship with total night charge, we should still pay attention to total night charge. For example, the night charge can be lower than the day.
  • 25. Assumptions Check and Further Research Firstly, the scatter plot shows that there is no correlation.(Check Assumption 2,4) Secondly, a histogram of the residuals i n d i c a t e s t h a t i t i s n o r m a l l y distributed.(Check Assumption 1 and 3) Thirdly, the P-value is 0.961 which is larger than 0.05, so the independent v a r i a b l e s a r e c o r r e l a t e d . ( C h e c k Assumption6) Therefore, all the assumptions satisfy the results. As the further research, we need to find more relationship which may cause the sales decline. For example, we can analyze the relationship between night calls and night charge.
  • 26. Capstone Project Milestone 5: Clustering • Data sourcd URL: https://www.kaggle.com/hellbuoy/online-retail-customer-clustering • The URL to my Kaggle.com link: https://www.kaggle.com/yueyaowang/customer- segementation-yw5244 • summary of the data sources: Online retail is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. The company mainly sells unique all-occasion gifts. Many customers of the company are wholesalers. • the statistic methods you choose and the result: The country I choose is Germany. K- mean cluster analysis and hierarchical clustering are two kind of method to research the case and get the result. Hierarchical clustering: 2 target customer return.
  • 27. K-Means Clustering When metric=”calinski_harabasz”: Computes the ratio of dispersion between and within clusters, and base on the research the K=4.
  • 28. Interpreting the Clustering Based on the RFM rule, we should choose the customer clusters with a lower recency, a higher frequency and amount. From the K-means clustering results, we can find that see that customers with Cluster_Id=1 best fits the criteria.
  • 29. Hierarchical Clustering Hierachical clustering visualize tree by linkage methods. In complete linkage hierachical clustering, the distance between two clusters is defined as the longest distance between two points in each cluster.
  • 30. Hierarchical Clustering Analysis Based on the RFM criteria, we should choose the customer clusters with a lower recency, a higher frequency and amount. From the K-means clustering results, we can find that see that customers with Cluster_Id=1 best fits the criteria.