SlideShare a Scribd company logo
Coursera Capstone project
Coursera IBM Data ScienceCertification
Ajay Kumar
( Dec 2019)
Report Content
1. Introduction Section :
⁃ The “business problem” to be solved by this project and who may be interested
2. Data Section:
⁃ Describe Data requirements and Sources needed to solve the problem
3. Methodology section:
⁃ Main component of the report - Execute data processing, describe/discuss any
exploratory data analysis and/or inferential statistical testing performed, and/or
machine learnings used.
4. Results section:
⁃ Discussion of the results and finding of answer
5. Discussion section:
⁃ Discussion of observations noted and any recommendations
6. Conclusion section:
⁃ Answer chosen and conclusions.
1.0 Introduction
1.1 Scenario and Background
I am currently living in Singapore, within walking distance to Downtown "Telok Ayer
MRT metro station" . I also enjoy great venues and attractions, such as international
cuisine, entertainment and shopping. I have an offer to move to work to Manhattan NY
and I would like to move if I can find a place to live similar with similar venues.
2. Problem to be resolved:
How to find an apartment in Manhattan with the following conditions:
• Apartment with min 2 bedrooms
• Monthly rent not to exceed US$7000/month
• Located within walking distance (<=1.0 mile, 1.6 km) from a subway metro station in
Manhattan
• Venues and amenities as in my current residence.
3. Interested Audience
I believe the methodology, tools and strategy used in this project is relevant for a person
or entity considering moving to a major city in US, Europe or Asia. Europe, US or Asia,
Likewise, it can be helpful approach to explore the opening of a new business. The use
of FourSquare data and mapping techniques combined with data analysis will help
resolve the key questions arisen. Lastly, this project is a good practical case for a person
developing Data Science skills.
2.0 Data Section
2.1 Data Requirements
- Geodata for current residence in Singapore with venues established using Foursquare.
-List of Manhattan (MH) neighborhoods with clustered venues established via Foursquare (as in Course
Lab). https://en.wikipedia.org/wiki/List_of_Manhattan_neighborhoods#Midtown_neighborhoods
-List of subway metro stations in Manhattan with addresses and geo data (lat,long): https://
en.wikipedia.org/wiki/List_of_New_York_City_Subway_stations_in_Manhattan) , (https://www.google.com/
maps/search/manhattan+subway+metro+stations/@40.7837297,-74.1033043,11z/data=!3m1!4b1)
-List of apartments for rent in Manhattan area with information on neighborhood location, address,
number of beds, area size, monthly rent price and complemented with geo data via Nominatim. http://
www.rentmanhattan.com/index.cfm?page=search&state=results https://www.nestpick.com/search?
city=new-
- Place to work in Manhattan (Park Avenue and 53rd St) for reference
2.2 Data Sources, Data Processing and Tools used
- Singapore data and map is to be created with use of Nominatim , Foursquare and Folium mapping
-Manhattan neighborhoods were obtained from Wikipedia and organized by Neighborhoods with geodata
via Nominatim for mapping with Folium.
- List of Subway stations was obtained via Wikipedia, NY Transit web site and Google map,
-List of apartments for rent was consolidated from web-scraping real estate sites for MH. The geolocation
(lat,long) data was found with algorithm coding and using Nominatim.
-Folium map was the basis of mapping with various features to consolidate all data in ONE map where
one can visualize all details needed to make a selection of apartment
3.0 Methodology
The Strategy to find the answer:
The strategy is based on mapping the described data in section 2.0, in order to
facilitate the choice of at least two candidate places for rent. The information will be
consolidated in ONE MAP where one can see the details of the apartment, the cluster
of venues in the neighborhood and the relative location from a subway station and
from work place. A measurement tool icon will also be provided. The popups on the
map items will display rent price, location and cluster of venues applicable.
The Tools:
Web-scraping of sites is used to consolidate data-frame information which was
saved as csv files for convenience and to simply the report. Geodata was obtained
by coding a program to use Nominatim to get latitude and longitude of subway
stations and also for each of (144 units) the apartments for rent listed.
Geopy_distance and Nominatim were used to establish relative distances. Seaborn
graphic was used for general statistics on rental data.
Maps with popups labels allow quick identification of location, price and feature, thus
making the selection very easy
4.0 Execution and Results
Current residence Neighborhood in Singapore
Venues around Neighborhood
in
Manhattan Map - Neighborhoods and Cluster of
Venues
GeoData Manhattan apts for rent
Rental Price Statistics MH Apartments
Budget US7000/month is around the mean
Apartments for Rent in
MH
MH apts for rent with venue clusters
Venues of cluster 3
Manhattan subway stations geodata
Apts for rent (blue) and subway stations (red)
Selected Apartment!
The ONE consolidated map shows all information for decision:
Apartments address, price, neighborhood, cluster of venues and subway station nearby.
Blue dots=apts , Red dots=Subway station, Bubbles=Cluster of Venues
Apartment Selection
Using the "one map" above, I was able to explore all possibilities since the popups
provide the information needed for a good decision.
Apartment 1 rent cost is US7500 slightly above the US7000 budget. Apt 1 is located
400 meters from subway station at 59th Street and work place (Park Ave and 53rd) is
another 600 meters way. I can walk to work place and use subway for other places
around. Venues for this apt are as of Cluster 2 and it is located in a fine district in the
East side of Manhattan.
Apartment 2 rent cost is US6935, just under the US7000 budget. Apt 2 is located 60
meters from subway station at Fulton Street, but I will have to ride the subway daily
to work , possibly 40-60 min ride. Venues for this apt are as of Cluster 3.¶
Based on current Singapore venues, I feel that Cluster 2 type of venues is a closer
resemblance to my current place. That means that APARTMENT 1 is a better choice
since the extra monthly rent is worth the conveniences it provides.
I will walk to work
Walk from home to work is less than 1 km!
Venus in Cluster 2 near future home
5.0 Discussion
In general, I ampositively impressed with the overall
organization, content andlabworks presented during
the Coursera IBM Certification Course
I feel this Capstone project presented meagreat
opportunity to practice andapplythe Data Science
tools andmethodologieslearned.
I havecreated agood project that I canpresent asan
example to show my potential.
I feel I haveacquired agood starting point to become a
professional Data Scientist andI will continue exploring
to creating examples of practical cases.
6.0 Conclusions
• I feel rewarded with the efforts, time andmoney spent. I
believe this course with all the topics covered is well worthy
of appreciation.
• This project hasshown meapractical application to resolve
areal situation that hasimpacting personal andfinancial
impact usingData Sciencetools.
• The mappingwith Folium is avery powerful technique to
consolidate information andmakethe analysisanddecision
thoroughly andwith confidence. I would recommend for
usein similarsituations.
• One must keepabreast of new tools for DSthat continue
to appearfor application in severalbusiness fields.
This Photo by Unknown author is licensed under CC BY-NC-ND.

More Related Content

What's hot

Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detection
vineeta vineeta
 
paytm
paytmpaytm
Three case studies deploying cluster analysis
Three case studies deploying cluster analysisThree case studies deploying cluster analysis
Three case studies deploying cluster analysis
Greg Makowski
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending Business
Sanjay Kar
 
Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detection
kalpesh1908
 
Iris data analysis example in R
Iris data analysis example in RIris data analysis example in R
Iris data analysis example in R
Duyen Do
 
National Retail Payment System and Proposed Payment Systems Act
National Retail Payment System and Proposed Payment Systems ActNational Retail Payment System and Proposed Payment Systems Act
National Retail Payment System and Proposed Payment Systems Act
Janette Toral
 
Detecting fraud with Python and machine learning
Detecting fraud with Python and machine learningDetecting fraud with Python and machine learning
Detecting fraud with Python and machine learning
wgyn
 
The Future of Big Tech
The Future of Big TechThe Future of Big Tech
The Future of Big Tech
Freddy Kelly
 
Credit scorecard
Credit scorecardCredit scorecard
Credit scorecard
Tuhin AI Advisory
 
Housing price prediction
Housing price predictionHousing price prediction
Housing price prediction
Abhimanyu Dwivedi
 
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
Neo4j
 
Supply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AISupply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AI
TigerGraph
 
Credit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In DatabricksCredit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In Databricks
Databricks
 
PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014
Sri Ambati
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World Systems
Manojit Nandi
 
KPMG Virtual Internship Task3.pptx
KPMG Virtual Internship Task3.pptxKPMG Virtual Internship Task3.pptx
KPMG Virtual Internship Task3.pptx
VIDHIYA S B
 
WSO2 Open Banking: Digital Transformation Through PSD2
WSO2 Open Banking: Digital Transformation Through PSD2WSO2 Open Banking: Digital Transformation Through PSD2
WSO2 Open Banking: Digital Transformation Through PSD2
WSO2
 
Credit card fraud detection through machine learning
Credit card fraud detection through machine learningCredit card fraud detection through machine learning
Credit card fraud detection through machine learning
dataalcott
 
An Introduction to RFM in Analytics
An Introduction to RFM in AnalyticsAn Introduction to RFM in Analytics
An Introduction to RFM in Analytics
SAS Canada
 

What's hot (20)

Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detection
 
paytm
paytmpaytm
paytm
 
Three case studies deploying cluster analysis
Three case studies deploying cluster analysisThree case studies deploying cluster analysis
Three case studies deploying cluster analysis
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending Business
 
Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detection
 
Iris data analysis example in R
Iris data analysis example in RIris data analysis example in R
Iris data analysis example in R
 
National Retail Payment System and Proposed Payment Systems Act
National Retail Payment System and Proposed Payment Systems ActNational Retail Payment System and Proposed Payment Systems Act
National Retail Payment System and Proposed Payment Systems Act
 
Detecting fraud with Python and machine learning
Detecting fraud with Python and machine learningDetecting fraud with Python and machine learning
Detecting fraud with Python and machine learning
 
The Future of Big Tech
The Future of Big TechThe Future of Big Tech
The Future of Big Tech
 
Credit scorecard
Credit scorecardCredit scorecard
Credit scorecard
 
Housing price prediction
Housing price predictionHousing price prediction
Housing price prediction
 
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
Training Series - Build A Routing Web Application With OpenStreetMap, Neo4j, ...
 
Supply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AISupply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AI
 
Credit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In DatabricksCredit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In Databricks
 
PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World Systems
 
KPMG Virtual Internship Task3.pptx
KPMG Virtual Internship Task3.pptxKPMG Virtual Internship Task3.pptx
KPMG Virtual Internship Task3.pptx
 
WSO2 Open Banking: Digital Transformation Through PSD2
WSO2 Open Banking: Digital Transformation Through PSD2WSO2 Open Banking: Digital Transformation Through PSD2
WSO2 Open Banking: Digital Transformation Through PSD2
 
Credit card fraud detection through machine learning
Credit card fraud detection through machine learningCredit card fraud detection through machine learning
Credit card fraud detection through machine learning
 
An Introduction to RFM in Analytics
An Introduction to RFM in AnalyticsAn Introduction to RFM in Analytics
An Introduction to RFM in Analytics
 

Similar to Coursera capstone report ajay kumar ppt

Capstone project the battle of neighborhoods
Capstone project   the battle of neighborhoodsCapstone project   the battle of neighborhoods
Capstone project the battle of neighborhoods
Dr Qaim Mehdi
 
MANUSCRIPT.docx
MANUSCRIPT.docxMANUSCRIPT.docx
MANUSCRIPT.docx
AlbaracinKingEzekiel
 
IRJET- Face Counter using Matlab
IRJET-  	  Face Counter using MatlabIRJET-  	  Face Counter using Matlab
IRJET- Face Counter using Matlab
IRJET Journal
 
IRJET- Popularity based Recommender Sytsem for Google Maps
IRJET-  	  Popularity based Recommender Sytsem for Google MapsIRJET-  	  Popularity based Recommender Sytsem for Google Maps
IRJET- Popularity based Recommender Sytsem for Google Maps
IRJET Journal
 
bhagat.pdf
bhagat.pdfbhagat.pdf
bhagat.pdf
Ayesha Lata
 
SMAC
SMACSMAC
SMAC
Mphasis
 
IRJET- Career Counselling Chatbot
IRJET-  	  Career Counselling ChatbotIRJET-  	  Career Counselling Chatbot
IRJET- Career Counselling Chatbot
IRJET Journal
 
Travelogue: A Travel Package Recommendation using Python
Travelogue: A Travel Package Recommendation using PythonTravelogue: A Travel Package Recommendation using Python
Travelogue: A Travel Package Recommendation using Python
IRJET Journal
 
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
rahulmonikasharma
 
Seminar and Project Manager and Resourceful Trainer(SMART)
Seminar and Project Manager and Resourceful Trainer(SMART)Seminar and Project Manager and Resourceful Trainer(SMART)
Seminar and Project Manager and Resourceful Trainer(SMART)
IOSR Journals
 
Internet-Based Geographical Information Systems for the Real Estate Marketing
Internet-Based Geographical Information Systems for the Real Estate MarketingInternet-Based Geographical Information Systems for the Real Estate Marketing
Internet-Based Geographical Information Systems for the Real Estate Marketing
iosrjce
 
H017235155
H017235155H017235155
H017235155
IOSR Journals
 
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docxRunning Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
toltonkendal
 
Samuel_Wayfinding_Metropolia_2015
Samuel_Wayfinding_Metropolia_2015Samuel_Wayfinding_Metropolia_2015
Samuel_Wayfinding_Metropolia_2015
dbelgi
 
My Places
My PlacesMy Places
My Places
IRJET Journal
 
REAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTIONREAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTION
IRJET Journal
 
Suche Da Carte
Suche Da CarteSuche Da Carte
Suche Da Carte
Poumita Das
 
A Survey of Building Robust Business Models in Pervasive Computing
A Survey of Building Robust Business Models in Pervasive ComputingA Survey of Building Robust Business Models in Pervasive Computing
A Survey of Building Robust Business Models in Pervasive Computing
Osama M. Khaled
 
IRJET- University Campus Event Navigation System
IRJET-  	  University Campus Event Navigation System   IRJET-  	  University Campus Event Navigation System
IRJET- University Campus Event Navigation System
IRJET Journal
 
location based service
location based servicelocation based service
location based service
Dark Side
 

Similar to Coursera capstone report ajay kumar ppt (20)

Capstone project the battle of neighborhoods
Capstone project   the battle of neighborhoodsCapstone project   the battle of neighborhoods
Capstone project the battle of neighborhoods
 
MANUSCRIPT.docx
MANUSCRIPT.docxMANUSCRIPT.docx
MANUSCRIPT.docx
 
IRJET- Face Counter using Matlab
IRJET-  	  Face Counter using MatlabIRJET-  	  Face Counter using Matlab
IRJET- Face Counter using Matlab
 
IRJET- Popularity based Recommender Sytsem for Google Maps
IRJET-  	  Popularity based Recommender Sytsem for Google MapsIRJET-  	  Popularity based Recommender Sytsem for Google Maps
IRJET- Popularity based Recommender Sytsem for Google Maps
 
bhagat.pdf
bhagat.pdfbhagat.pdf
bhagat.pdf
 
SMAC
SMACSMAC
SMAC
 
IRJET- Career Counselling Chatbot
IRJET-  	  Career Counselling ChatbotIRJET-  	  Career Counselling Chatbot
IRJET- Career Counselling Chatbot
 
Travelogue: A Travel Package Recommendation using Python
Travelogue: A Travel Package Recommendation using PythonTravelogue: A Travel Package Recommendation using Python
Travelogue: A Travel Package Recommendation using Python
 
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
Prediction Techniques in Internet of Things (IoT) Environment: A Comparative ...
 
Seminar and Project Manager and Resourceful Trainer(SMART)
Seminar and Project Manager and Resourceful Trainer(SMART)Seminar and Project Manager and Resourceful Trainer(SMART)
Seminar and Project Manager and Resourceful Trainer(SMART)
 
Internet-Based Geographical Information Systems for the Real Estate Marketing
Internet-Based Geographical Information Systems for the Real Estate MarketingInternet-Based Geographical Information Systems for the Real Estate Marketing
Internet-Based Geographical Information Systems for the Real Estate Marketing
 
H017235155
H017235155H017235155
H017235155
 
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docxRunning Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
Running Head TOWN GUIDE ANDROID APPLICATION5TOWN GUIDE ANDROI.docx
 
Samuel_Wayfinding_Metropolia_2015
Samuel_Wayfinding_Metropolia_2015Samuel_Wayfinding_Metropolia_2015
Samuel_Wayfinding_Metropolia_2015
 
My Places
My PlacesMy Places
My Places
 
REAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTIONREAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTION
 
Suche Da Carte
Suche Da CarteSuche Da Carte
Suche Da Carte
 
A Survey of Building Robust Business Models in Pervasive Computing
A Survey of Building Robust Business Models in Pervasive ComputingA Survey of Building Robust Business Models in Pervasive Computing
A Survey of Building Robust Business Models in Pervasive Computing
 
IRJET- University Campus Event Navigation System
IRJET-  	  University Campus Event Navigation System   IRJET-  	  University Campus Event Navigation System
IRJET- University Campus Event Navigation System
 
location based service
location based servicelocation based service
location based service
 

Recently uploaded

zOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL DifferenceszOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL Differences
YousufSait3
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
brainerhub1
 
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
mz5nrf0n
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Łukasz Chruściel
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
UI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design SystemUI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design System
Peter Muessig
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian CompaniesE-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
Quickdice ERP
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
SOCRadar
 
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
ssuserad3af4
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
Philip Schwarz
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
mz5nrf0n
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
Peter Muessig
 
Top 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptxTop 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptx
devvsandy
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
Green Software Development
 
Microservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we workMicroservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we work
Sven Peters
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
Łukasz Chruściel
 
SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
Yara Milbes
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
Hornet Dynamics
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
ToXSL Technologies
 

Recently uploaded (20)

zOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL DifferenceszOS Mainframe JES2-JES3 JCL-JECL Differences
zOS Mainframe JES2-JES3 JCL-JECL Differences
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
 
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
UI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design SystemUI5con 2024 - Bring Your Own Design System
UI5con 2024 - Bring Your Own Design System
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian CompaniesE-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
E-Invoicing Implementation: A Step-by-Step Guide for Saudi Arabian Companies
 
socradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdfsocradar-q1-2024-aviation-industry-report.pdf
socradar-q1-2024-aviation-industry-report.pdf
 
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
316895207-SAP-Oil-and-Gas-Downstream-Training.pptx
 
Hand Rolled Applicative User Validation Code Kata
Hand Rolled Applicative User ValidationCode KataHand Rolled Applicative User ValidationCode Kata
Hand Rolled Applicative User Validation Code Kata
 
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
原版定制美国纽约州立大学奥尔巴尼分校毕业证学位证书原版一模一样
 
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling ExtensionsUI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions
 
Top 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptxTop 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptx
 
GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
 
Microservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we workMicroservice Teams - How the cloud changes the way we work
Microservice Teams - How the cloud changes the way we work
 
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf2024 eCommerceDays Toulouse - Sylius 2.0.pdf
2024 eCommerceDays Toulouse - Sylius 2.0.pdf
 
SMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API ServiceSMS API Integration in Saudi Arabia| Best SMS API Service
SMS API Integration in Saudi Arabia| Best SMS API Service
 
E-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet DynamicsE-commerce Development Services- Hornet Dynamics
E-commerce Development Services- Hornet Dynamics
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
 

Coursera capstone report ajay kumar ppt

  • 1. Coursera Capstone project Coursera IBM Data ScienceCertification Ajay Kumar ( Dec 2019)
  • 2. Report Content 1. Introduction Section : ⁃ The “business problem” to be solved by this project and who may be interested 2. Data Section: ⁃ Describe Data requirements and Sources needed to solve the problem 3. Methodology section: ⁃ Main component of the report - Execute data processing, describe/discuss any exploratory data analysis and/or inferential statistical testing performed, and/or machine learnings used. 4. Results section: ⁃ Discussion of the results and finding of answer 5. Discussion section: ⁃ Discussion of observations noted and any recommendations 6. Conclusion section: ⁃ Answer chosen and conclusions.
  • 3. 1.0 Introduction 1.1 Scenario and Background I am currently living in Singapore, within walking distance to Downtown "Telok Ayer MRT metro station" . I also enjoy great venues and attractions, such as international cuisine, entertainment and shopping. I have an offer to move to work to Manhattan NY and I would like to move if I can find a place to live similar with similar venues. 2. Problem to be resolved: How to find an apartment in Manhattan with the following conditions: • Apartment with min 2 bedrooms • Monthly rent not to exceed US$7000/month • Located within walking distance (<=1.0 mile, 1.6 km) from a subway metro station in Manhattan • Venues and amenities as in my current residence. 3. Interested Audience I believe the methodology, tools and strategy used in this project is relevant for a person or entity considering moving to a major city in US, Europe or Asia. Europe, US or Asia, Likewise, it can be helpful approach to explore the opening of a new business. The use of FourSquare data and mapping techniques combined with data analysis will help resolve the key questions arisen. Lastly, this project is a good practical case for a person developing Data Science skills.
  • 4. 2.0 Data Section 2.1 Data Requirements - Geodata for current residence in Singapore with venues established using Foursquare. -List of Manhattan (MH) neighborhoods with clustered venues established via Foursquare (as in Course Lab). https://en.wikipedia.org/wiki/List_of_Manhattan_neighborhoods#Midtown_neighborhoods -List of subway metro stations in Manhattan with addresses and geo data (lat,long): https:// en.wikipedia.org/wiki/List_of_New_York_City_Subway_stations_in_Manhattan) , (https://www.google.com/ maps/search/manhattan+subway+metro+stations/@40.7837297,-74.1033043,11z/data=!3m1!4b1) -List of apartments for rent in Manhattan area with information on neighborhood location, address, number of beds, area size, monthly rent price and complemented with geo data via Nominatim. http:// www.rentmanhattan.com/index.cfm?page=search&state=results https://www.nestpick.com/search? city=new- - Place to work in Manhattan (Park Avenue and 53rd St) for reference 2.2 Data Sources, Data Processing and Tools used - Singapore data and map is to be created with use of Nominatim , Foursquare and Folium mapping -Manhattan neighborhoods were obtained from Wikipedia and organized by Neighborhoods with geodata via Nominatim for mapping with Folium. - List of Subway stations was obtained via Wikipedia, NY Transit web site and Google map, -List of apartments for rent was consolidated from web-scraping real estate sites for MH. The geolocation (lat,long) data was found with algorithm coding and using Nominatim. -Folium map was the basis of mapping with various features to consolidate all data in ONE map where one can visualize all details needed to make a selection of apartment
  • 5. 3.0 Methodology The Strategy to find the answer: The strategy is based on mapping the described data in section 2.0, in order to facilitate the choice of at least two candidate places for rent. The information will be consolidated in ONE MAP where one can see the details of the apartment, the cluster of venues in the neighborhood and the relative location from a subway station and from work place. A measurement tool icon will also be provided. The popups on the map items will display rent price, location and cluster of venues applicable. The Tools: Web-scraping of sites is used to consolidate data-frame information which was saved as csv files for convenience and to simply the report. Geodata was obtained by coding a program to use Nominatim to get latitude and longitude of subway stations and also for each of (144 units) the apartments for rent listed. Geopy_distance and Nominatim were used to establish relative distances. Seaborn graphic was used for general statistics on rental data. Maps with popups labels allow quick identification of location, price and feature, thus making the selection very easy
  • 9. Manhattan Map - Neighborhoods and Cluster of Venues
  • 11. Rental Price Statistics MH Apartments Budget US7000/month is around the mean
  • 13. MH apts for rent with venue clusters
  • 16. Apts for rent (blue) and subway stations (red)
  • 17. Selected Apartment! The ONE consolidated map shows all information for decision: Apartments address, price, neighborhood, cluster of venues and subway station nearby. Blue dots=apts , Red dots=Subway station, Bubbles=Cluster of Venues
  • 18. Apartment Selection Using the "one map" above, I was able to explore all possibilities since the popups provide the information needed for a good decision. Apartment 1 rent cost is US7500 slightly above the US7000 budget. Apt 1 is located 400 meters from subway station at 59th Street and work place (Park Ave and 53rd) is another 600 meters way. I can walk to work place and use subway for other places around. Venues for this apt are as of Cluster 2 and it is located in a fine district in the East side of Manhattan. Apartment 2 rent cost is US6935, just under the US7000 budget. Apt 2 is located 60 meters from subway station at Fulton Street, but I will have to ride the subway daily to work , possibly 40-60 min ride. Venues for this apt are as of Cluster 3.¶ Based on current Singapore venues, I feel that Cluster 2 type of venues is a closer resemblance to my current place. That means that APARTMENT 1 is a better choice since the extra monthly rent is worth the conveniences it provides.
  • 19. I will walk to work Walk from home to work is less than 1 km!
  • 20. Venus in Cluster 2 near future home
  • 21. 5.0 Discussion In general, I ampositively impressed with the overall organization, content andlabworks presented during the Coursera IBM Certification Course I feel this Capstone project presented meagreat opportunity to practice andapplythe Data Science tools andmethodologieslearned. I havecreated agood project that I canpresent asan example to show my potential. I feel I haveacquired agood starting point to become a professional Data Scientist andI will continue exploring to creating examples of practical cases.
  • 22. 6.0 Conclusions • I feel rewarded with the efforts, time andmoney spent. I believe this course with all the topics covered is well worthy of appreciation. • This project hasshown meapractical application to resolve areal situation that hasimpacting personal andfinancial impact usingData Sciencetools. • The mappingwith Folium is avery powerful technique to consolidate information andmakethe analysisanddecision thoroughly andwith confidence. I would recommend for usein similarsituations. • One must keepabreast of new tools for DSthat continue to appearfor application in severalbusiness fields.
  • 23. This Photo by Unknown author is licensed under CC BY-NC-ND.