SlideShare a Scribd company logo
1 of 18
ChoosingThe Best
Neighbourhood for
Nigerian Barbeque
Business inToronto
The Problem
• Yahuza Suya wants to open a new branch of their Nigerian-style
barbeque chicken and beef in Toronto.
• They want to eliminate the likelihood of failure as much as possible.
• They are clear about their factors for success
• 1. Safety and Security
• 2. Income within pricing market
• 3. Close to transportation stations- due to immigrant community
presence
My proposed solution
• 1. Get the data on the various neighbourhoods
• 2. Compare them against others
• 3. Pick out the neighbourhoods that meet this criteria most
• 4. Pick out the neighbourhood that satisfies these conditions and is supported by research
Data
• 1. Crime Data
• 2. Income Data
• 3. Transportation station data
• 4.Toronto postal codes, neighbourhoods and coordinates data
• 5. Foursquare Data
Tools
• 1. Dataframes
• 2. Maps
• 3. Foursquare API
• 4. CSV Tables
Python Libraries,Functions and Techniques Used
• 1 NumPy for computation
• 2. Folium for building maps
• 3. Pandas for handling dataframes
• 4. BeautifulSoup for scraping websites for data
• 5. Requests for getting results
• 6. Geopy for handling location data
• 7. sklearn for k-means clustering
Sources of Data
• Sources
• General Toronto data was obtained from
• https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M
•
• The demographic data was obtained from https://en.wikipedia.org/wiki/Demographics_of_Toronto_neighbourhoods
•
• The crime data was obtained from http://data.torontopolice.on.ca/datasets?q=crime
•
• The list of Metro stations was obtained from https://en.wikipedia.org/wiki/List_of_Toronto_subway_stations
Methodology
• We first import all necessary libraries. We ignore libraries like folium and sklearn until we need
them so that it does not slow down the processing
• Neigbourhood Data- Scrape
• https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M.
• The resulting dataframe:
I clean up, drop non-assigned cells and rename cells to ensure consistency . The resulting dataframe is:
Income Data:
We scrape 'https://en.wikipedia.org/wiki/Demographics_of_Toronto_neighbourhoods’
After cleaning and sorting we have :
Crime Data
We import csv . Originally extracted from data.torontopolice.ca, it was cleaned and sorted then uploaded
On a website . Resulting Dataframe:
Visualization of Toronto and Toronto Metro Stations
USING FOURSQUARE, WE OBTAIN VENUE RESULTS AND PERFORM K-MEANS
Cluster visualisation below where k=5
A snapshot of cluster 3 which is our where our preferred neighbourhood is likely to come from:
BY narrowing down on the neighbourhoods which satisfy our conditions and also fit the likely clusters, we came to:
Lawrence Park
Rosedale
Parkwoods
Willowdale West
Riverdale West
Mimico
On further observation,, we narrow down to three and have more specific results, I look closely and see that:
1.Lawrence Parksatisfies all the three requirements and is close to a college.
This will attract young adults who have an active nightlife.
2. Willowdale Westis also a good idea as it satisfies the requirements.
3. The third option will be Parkwoods, which also satisfies the requirement of safety and income.
Further search reveals there are many immigrants.
CONCLUSIONS
Choosing a good neighbourhood usually depends on more than one factor. In most cases, it is usually a healthy combination of
important factors.
For this task, while safety and pricing were major factors, it would have been counter productive to choose based on these two factors
alone. The use of foursquare data and the venues provided gave us a more descriptive view of these neighbourhoods and we are able
to make a more robust decision.
Such tasks show the important of Data Science as it enables fact-based decision-making, even if it means coming to the same
assumption.
Without Data, You’re just another person with an
opinion
• - w. Edwards deming

More Related Content

Similar to Battle of the neighbourhoods toronto yahuza suya

Dark Side of the Net Lecture 4 TOR
Dark Side of the Net Lecture 4 TOR Dark Side of the Net Lecture 4 TOR
Dark Side of the Net Lecture 4 TOR Marcus Leaning
 
Intranets 2014, Mobility for Queensland Police
Intranets 2014, Mobility for Queensland PoliceIntranets 2014, Mobility for Queensland Police
Intranets 2014, Mobility for Queensland PoliceCorne Grotius
 
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtpRajprakash Dwivedi
 
A Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsA Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsOscar Corcho
 
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...Suhas Dutta
 
Creating an Itinerary in Toronto, Canada
Creating an Itinerary in Toronto, CanadaCreating an Itinerary in Toronto, Canada
Creating an Itinerary in Toronto, CanadaVirgilio Espina
 
A Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura WynterA Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura Wynterwkwsci-research
 
IoT - The Building Block for IR 4.0
IoT - The Building Block for IR 4.0IoT - The Building Block for IR 4.0
IoT - The Building Block for IR 4.0Dr. Mazlan Abbas
 
Data & Society Taxi Privacy Talk
Data & Society Taxi Privacy TalkData & Society Taxi Privacy Talk
Data & Society Taxi Privacy Talkcwhong
 
Daniel Murray Final Report
Daniel Murray Final ReportDaniel Murray Final Report
Daniel Murray Final ReportDaniel Murray
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsSloan Carne
 
Eating the elephant
Eating the elephantEating the elephant
Eating the elephantRamece Cave
 
Data sciences and marketing analytics
Data sciences and marketing analyticsData sciences and marketing analytics
Data sciences and marketing analyticsMJ Xavier
 
Designing for Mobility - Spot The Dangerous Criminal
Designing for Mobility - Spot The Dangerous CriminalDesigning for Mobility - Spot The Dangerous Criminal
Designing for Mobility - Spot The Dangerous CriminalSMS Management and Technology
 
Capstone project the battle of neighborhoods presentation
Capstone project the battle of neighborhoods presentationCapstone project the battle of neighborhoods presentation
Capstone project the battle of neighborhoods presentationLakshmi Narayanan S
 
Interconnection in Regional Markets
Interconnection in Regional MarketsInterconnection in Regional Markets
Interconnection in Regional MarketsAPNIC
 
Revenue Growth through Machine Learning
Revenue Growth through Machine LearningRevenue Growth through Machine Learning
Revenue Growth through Machine LearningDataWorks Summit
 
Summit EU Machine Learning
Summit EU  Machine LearningSummit EU  Machine Learning
Summit EU Machine LearningTed Dunning
 
Connections Drive Digital Transformation
Connections Drive Digital TransformationConnections Drive Digital Transformation
Connections Drive Digital TransformationNeo4j
 

Similar to Battle of the neighbourhoods toronto yahuza suya (20)

Dark Side of the Net Lecture 4 TOR
Dark Side of the Net Lecture 4 TOR Dark Side of the Net Lecture 4 TOR
Dark Side of the Net Lecture 4 TOR
 
Intranets 2014, Mobility for Queensland Police
Intranets 2014, Mobility for Queensland PoliceIntranets 2014, Mobility for Queensland Police
Intranets 2014, Mobility for Queensland Police
 
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
 
A Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsA Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's Datasets
 
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...
Ecommerce Retail Trends in India (2013)- Data based insights from Tier 2 and ...
 
Creating an Itinerary in Toronto, Canada
Creating an Itinerary in Toronto, CanadaCreating an Itinerary in Toronto, Canada
Creating an Itinerary in Toronto, Canada
 
A Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura WynterA Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura Wynter
 
IoT - The Building Block for IR 4.0
IoT - The Building Block for IR 4.0IoT - The Building Block for IR 4.0
IoT - The Building Block for IR 4.0
 
Data & Society Taxi Privacy Talk
Data & Society Taxi Privacy TalkData & Society Taxi Privacy Talk
Data & Society Taxi Privacy Talk
 
Daniel Murray Final Report
Daniel Murray Final ReportDaniel Murray Final Report
Daniel Murray Final Report
 
Revealing True Human Mobility Pattern Using Smart Data
Revealing True Human Mobility Pattern Using Smart DataRevealing True Human Mobility Pattern Using Smart Data
Revealing True Human Mobility Pattern Using Smart Data
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU Investigators
 
Eating the elephant
Eating the elephantEating the elephant
Eating the elephant
 
Data sciences and marketing analytics
Data sciences and marketing analyticsData sciences and marketing analytics
Data sciences and marketing analytics
 
Designing for Mobility - Spot The Dangerous Criminal
Designing for Mobility - Spot The Dangerous CriminalDesigning for Mobility - Spot The Dangerous Criminal
Designing for Mobility - Spot The Dangerous Criminal
 
Capstone project the battle of neighborhoods presentation
Capstone project the battle of neighborhoods presentationCapstone project the battle of neighborhoods presentation
Capstone project the battle of neighborhoods presentation
 
Interconnection in Regional Markets
Interconnection in Regional MarketsInterconnection in Regional Markets
Interconnection in Regional Markets
 
Revenue Growth through Machine Learning
Revenue Growth through Machine LearningRevenue Growth through Machine Learning
Revenue Growth through Machine Learning
 
Summit EU Machine Learning
Summit EU  Machine LearningSummit EU  Machine Learning
Summit EU Machine Learning
 
Connections Drive Digital Transformation
Connections Drive Digital TransformationConnections Drive Digital Transformation
Connections Drive Digital Transformation
 

Recently uploaded

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

Battle of the neighbourhoods toronto yahuza suya

  • 1. ChoosingThe Best Neighbourhood for Nigerian Barbeque Business inToronto
  • 2. The Problem • Yahuza Suya wants to open a new branch of their Nigerian-style barbeque chicken and beef in Toronto. • They want to eliminate the likelihood of failure as much as possible. • They are clear about their factors for success • 1. Safety and Security • 2. Income within pricing market • 3. Close to transportation stations- due to immigrant community presence
  • 3. My proposed solution • 1. Get the data on the various neighbourhoods • 2. Compare them against others • 3. Pick out the neighbourhoods that meet this criteria most • 4. Pick out the neighbourhood that satisfies these conditions and is supported by research
  • 4. Data • 1. Crime Data • 2. Income Data • 3. Transportation station data • 4.Toronto postal codes, neighbourhoods and coordinates data • 5. Foursquare Data
  • 5. Tools • 1. Dataframes • 2. Maps • 3. Foursquare API • 4. CSV Tables
  • 6. Python Libraries,Functions and Techniques Used • 1 NumPy for computation • 2. Folium for building maps • 3. Pandas for handling dataframes • 4. BeautifulSoup for scraping websites for data • 5. Requests for getting results • 6. Geopy for handling location data • 7. sklearn for k-means clustering
  • 7. Sources of Data • Sources • General Toronto data was obtained from • https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M • • The demographic data was obtained from https://en.wikipedia.org/wiki/Demographics_of_Toronto_neighbourhoods • • The crime data was obtained from http://data.torontopolice.on.ca/datasets?q=crime • • The list of Metro stations was obtained from https://en.wikipedia.org/wiki/List_of_Toronto_subway_stations
  • 8. Methodology • We first import all necessary libraries. We ignore libraries like folium and sklearn until we need them so that it does not slow down the processing • Neigbourhood Data- Scrape • https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M. • The resulting dataframe:
  • 9.
  • 10. I clean up, drop non-assigned cells and rename cells to ensure consistency . The resulting dataframe is:
  • 11. Income Data: We scrape 'https://en.wikipedia.org/wiki/Demographics_of_Toronto_neighbourhoods’ After cleaning and sorting we have :
  • 12. Crime Data We import csv . Originally extracted from data.torontopolice.ca, it was cleaned and sorted then uploaded On a website . Resulting Dataframe:
  • 13. Visualization of Toronto and Toronto Metro Stations
  • 14. USING FOURSQUARE, WE OBTAIN VENUE RESULTS AND PERFORM K-MEANS Cluster visualisation below where k=5
  • 15. A snapshot of cluster 3 which is our where our preferred neighbourhood is likely to come from:
  • 16. BY narrowing down on the neighbourhoods which satisfy our conditions and also fit the likely clusters, we came to: Lawrence Park Rosedale Parkwoods Willowdale West Riverdale West Mimico On further observation,, we narrow down to three and have more specific results, I look closely and see that: 1.Lawrence Parksatisfies all the three requirements and is close to a college. This will attract young adults who have an active nightlife. 2. Willowdale Westis also a good idea as it satisfies the requirements. 3. The third option will be Parkwoods, which also satisfies the requirement of safety and income. Further search reveals there are many immigrants.
  • 17. CONCLUSIONS Choosing a good neighbourhood usually depends on more than one factor. In most cases, it is usually a healthy combination of important factors. For this task, while safety and pricing were major factors, it would have been counter productive to choose based on these two factors alone. The use of foursquare data and the venues provided gave us a more descriptive view of these neighbourhoods and we are able to make a more robust decision. Such tasks show the important of Data Science as it enables fact-based decision-making, even if it means coming to the same assumption.
  • 18. Without Data, You’re just another person with an opinion • - w. Edwards deming