SlideShare a Scribd company logo
1 of 36
Download to read offline
PredictionIO
Adam Wang
Annie Cheng
Brett Clancy
Erin Chen
Han-Wen Chen
Siyuan Wang
Yiting Wang
Koober–
Koober
Meet Tim
Koober
Tim’s Challenge
+
Provide an efficient service for customers and taxi
drivers by…
Wait Times Revenue
Tim has some questions
How should taxis be allocated
among different locations?
Koober
What factors drive customers
to use taxis more?
But most importantly…
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Current Solutions
Koober
iTaxi
Shiny
TaxiPrediction
Our Solution
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Koober
Why Koober?
Demand Analysis + Prediction. Empower taxi
owners with valuable rider information backed by
past trends.
Koober
Anyone in Ride Sharing Industry. Plan more
appropriate routes and make better transportation
decisions.
So how does it work?
Koober
1. Choose your info need: analysis or prediction.
OR
So how does it work?
Koober
2. Adjust your input query parameters.
So how does it work?
Koober
3. Visualize taxi demand on the heat maps.
So how does it work?
Koober
4. Make more informed taxi resource decisions.
Let’s try it out!
https://koober-dashboard.herokuapp.com/
Koober
How does it really work?
Koober
Train and test machine learning models.
Display demand on interactive heat maps.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Raw Data
NYC Taxi & Limousine Commission:
Yellow Taxi Cab Dataset
Koober
Weather Underground API:
• Historical Weather Data
• Current Conditions
• 10-Day Forecast
Task 1
Demo Data Loader
Koober
Task 1
Development Production
• Generate fake traffic data
based on user-defined
parameters
• Simplify debugging and
validating the model
prediction result
• Process NYC Taxi Data and
integrate with the historical
weather data
• Incorporate Kafka to
facilitate importing large
amount of training data
Feature Extraction
Koober
Task 1
{
   "eventTime": "2017-01-20T18:54:07.000-05:00",
   "lat": 40.713802337646484,
   "lng": -77.0088882446289,
   "temperature": 0,
   "clear": 1,
   "fog": 0,
   "rain": 0,
   "snow": 0,
   "hail": 0,
   "thunder": 0,
   "tornado": 0
}
Weather
Location
Event Time
Temperature
Location Clustering
Koober
Preparator:
• K-Means Location Clustering
(200 clusters)
• Each cluster represents a
neighborhood geographically
• Demand Aggregation per Unit
Time Interval and Location
Cluster
Task 1
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 2
Gradient-Boosted Trees
Linear Regression with Stochastic Gradient Descent
Neural Network
Random Forest
Ridge Regression
Multiple Models. Supports many classic individual
and combined machine learning models.
Models
Koober
Task 2
PredictionIO - Train
Koober
Task 2
PredictionIO - Predict
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 4
User
Interface
Time
Location
Weather Predict Query
Predict Result
for Each
Algorithm
Map GL
component
Mapbox PredictionIO
Engine
Web App
Map Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Looking Forward
Koober
ModelsTraining Data Prediction Modes
More Cities
Uber/Lyft
Better Tuning
New Models
Wait Times
Prices
Our Mentor
Engineering and Open Source Ambassador at
Salesforce.com
Koober
James Ward
www.jamesward.com
Our Professor
Assistant Professor in Computer Science at
Cornell University
Koober
Ross Tate
http://www.cs.cornell.edu/~ross/
Contribute at
Koober
https://github.com/jamesward/
koober
Questions?
Koober

More Related Content

Similar to Koober Preduction IO Presentation

The Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisThe Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisSingleStore
 
Od ifriday openraildata
Od ifriday openraildataOd ifriday openraildata
Od ifriday openraildataJonathan Raper
 
How can Open Data Revolutionise your Rail Travel?
How can Open Data Revolutionise your Rail Travel?How can Open Data Revolutionise your Rail Travel?
How can Open Data Revolutionise your Rail Travel?theODI
 
Disruptive open transport data
Disruptive open transport dataDisruptive open transport data
Disruptive open transport dataJonathan Raper
 
Machine Learning in Real-Time
Machine Learning in Real-TimeMachine Learning in Real-Time
Machine Learning in Real-TimeAlex Combessie
 
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceDynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceJoseph Chow
 
Taxi Demand Prediction using Machine Learning.
Taxi Demand Prediction using Machine Learning.Taxi Demand Prediction using Machine Learning.
Taxi Demand Prediction using Machine Learning.IRJET Journal
 
EVfleetSolutions_ver2211(Eng) (1).pptx
EVfleetSolutions_ver2211(Eng) (1).pptxEVfleetSolutions_ver2211(Eng) (1).pptx
EVfleetSolutions_ver2211(Eng) (1).pptxYuukiKubo1
 
Dynamic Real time taxi ride-sharing android Application
Dynamic Real time taxi ride-sharing android Application  Dynamic Real time taxi ride-sharing android Application
Dynamic Real time taxi ride-sharing android Application IRJET Journal
 
Smart Traveller- Proficient Taxi Business Application
Smart Traveller- Proficient Taxi Business ApplicationSmart Traveller- Proficient Taxi Business Application
Smart Traveller- Proficient Taxi Business ApplicationGaurav Kumbhar
 
Robi Idea Contest winning team Presentation
Robi Idea Contest winning team PresentationRobi Idea Contest winning team Presentation
Robi Idea Contest winning team PresentationYeasin Ar Rahman
 
Vehicle tracking system,be computer android report,android project report,gps...
Vehicle tracking system,be computer android report,android project report,gps...Vehicle tracking system,be computer android report,android project report,gps...
Vehicle tracking system,be computer android report,android project report,gps...Sujit9561
 
Vehicle tracking system
Vehicle tracking systemVehicle tracking system
Vehicle tracking systemSujit9561
 
Introduction to Microsoft Fabric.pdf
Introduction to Microsoft Fabric.pdfIntroduction to Microsoft Fabric.pdf
Introduction to Microsoft Fabric.pdfishaniuudeshika
 
Our Experience on Google Map Integration with Apps
Our Experience on Google Map Integration with AppsOur Experience on Google Map Integration with Apps
Our Experience on Google Map Integration with AppsNeev Technologies
 
Mastercard ITS Smart City Hackathon Webex Presentation
Mastercard ITS Smart City Hackathon Webex PresentationMastercard ITS Smart City Hackathon Webex Presentation
Mastercard ITS Smart City Hackathon Webex PresentationBrian Collins
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...Carol McDonald
 
VTA Hack My Ride Lightning Talks
VTA Hack My Ride Lightning TalksVTA Hack My Ride Lightning Talks
VTA Hack My Ride Lightning TalksSCVTA
 

Similar to Koober Preduction IO Presentation (20)

The Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and AnalysisThe Impact of Always-on Connectivity for Geospatial Applications and Analysis
The Impact of Always-on Connectivity for Geospatial Applications and Analysis
 
Od ifriday openraildata
Od ifriday openraildataOd ifriday openraildata
Od ifriday openraildata
 
How can Open Data Revolutionise your Rail Travel?
How can Open Data Revolutionise your Rail Travel?How can Open Data Revolutionise your Rail Travel?
How can Open Data Revolutionise your Rail Travel?
 
Disruptive open transport data
Disruptive open transport dataDisruptive open transport data
Disruptive open transport data
 
BikersPlanet.pptx
BikersPlanet.pptxBikersPlanet.pptx
BikersPlanet.pptx
 
Machine Learning in Real-Time
Machine Learning in Real-TimeMachine Learning in Real-Time
Machine Learning in Real-Time
 
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet ServiceDynamic Fleet Sizing Problem for an E-Scooter Valet Service
Dynamic Fleet Sizing Problem for an E-Scooter Valet Service
 
Taxi Demand Prediction using Machine Learning.
Taxi Demand Prediction using Machine Learning.Taxi Demand Prediction using Machine Learning.
Taxi Demand Prediction using Machine Learning.
 
EVfleetSolutions_ver2211(Eng) (1).pptx
EVfleetSolutions_ver2211(Eng) (1).pptxEVfleetSolutions_ver2211(Eng) (1).pptx
EVfleetSolutions_ver2211(Eng) (1).pptx
 
Dynamic Real time taxi ride-sharing android Application
Dynamic Real time taxi ride-sharing android Application  Dynamic Real time taxi ride-sharing android Application
Dynamic Real time taxi ride-sharing android Application
 
Smart Traveller- Proficient Taxi Business Application
Smart Traveller- Proficient Taxi Business ApplicationSmart Traveller- Proficient Taxi Business Application
Smart Traveller- Proficient Taxi Business Application
 
Robi Idea Contest winning team Presentation
Robi Idea Contest winning team PresentationRobi Idea Contest winning team Presentation
Robi Idea Contest winning team Presentation
 
Dublin City Council - Congestion
Dublin City Council - CongestionDublin City Council - Congestion
Dublin City Council - Congestion
 
Vehicle tracking system,be computer android report,android project report,gps...
Vehicle tracking system,be computer android report,android project report,gps...Vehicle tracking system,be computer android report,android project report,gps...
Vehicle tracking system,be computer android report,android project report,gps...
 
Vehicle tracking system
Vehicle tracking systemVehicle tracking system
Vehicle tracking system
 
Introduction to Microsoft Fabric.pdf
Introduction to Microsoft Fabric.pdfIntroduction to Microsoft Fabric.pdf
Introduction to Microsoft Fabric.pdf
 
Our Experience on Google Map Integration with Apps
Our Experience on Google Map Integration with AppsOur Experience on Google Map Integration with Apps
Our Experience on Google Map Integration with Apps
 
Mastercard ITS Smart City Hackathon Webex Presentation
Mastercard ITS Smart City Hackathon Webex PresentationMastercard ITS Smart City Hackathon Webex Presentation
Mastercard ITS Smart City Hackathon Webex Presentation
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
 
VTA Hack My Ride Lightning Talks
VTA Hack My Ride Lightning TalksVTA Hack My Ride Lightning Talks
VTA Hack My Ride Lightning Talks
 

More from Salesforce Engineering

Locker Service Ready Lightning Components With Webpack
Locker Service Ready Lightning Components With WebpackLocker Service Ready Lightning Components With Webpack
Locker Service Ready Lightning Components With WebpackSalesforce Engineering
 
Techniques to Effectively Monitor the Performance of Customers in the Cloud
Techniques to Effectively Monitor the Performance of Customers in the CloudTechniques to Effectively Monitor the Performance of Customers in the Cloud
Techniques to Effectively Monitor the Performance of Customers in the CloudSalesforce Engineering
 
Predictive System Performance Data Analysis
Predictive System Performance Data AnalysisPredictive System Performance Data Analysis
Predictive System Performance Data AnalysisSalesforce Engineering
 
Aspect Oriented Programming: Hidden Toolkit That You Already Have
Aspect Oriented Programming: Hidden Toolkit That You Already HaveAspect Oriented Programming: Hidden Toolkit That You Already Have
Aspect Oriented Programming: Hidden Toolkit That You Already HaveSalesforce Engineering
 
A Smarter Pig: Building a SQL interface to Pig using Apache Calcite
A Smarter Pig: Building a SQL interface to Pig using Apache CalciteA Smarter Pig: Building a SQL interface to Pig using Apache Calcite
A Smarter Pig: Building a SQL interface to Pig using Apache CalciteSalesforce Engineering
 
Implementing a Content Strategy Is Like Running 100 Miles
Implementing a Content Strategy Is Like Running 100 MilesImplementing a Content Strategy Is Like Running 100 Miles
Implementing a Content Strategy Is Like Running 100 MilesSalesforce Engineering
 
Salesforce Cloud Infrastructure and Challenges - A Brief Overview
Salesforce Cloud Infrastructure and Challenges - A Brief OverviewSalesforce Cloud Infrastructure and Challenges - A Brief Overview
Salesforce Cloud Infrastructure and Challenges - A Brief OverviewSalesforce Engineering
 
Global State Management of Micro Services
Global State Management of Micro ServicesGlobal State Management of Micro Services
Global State Management of Micro ServicesSalesforce Engineering
 
Apache BookKeeper Distributed Store- a Salesforce use case
Apache BookKeeper Distributed Store- a Salesforce use caseApache BookKeeper Distributed Store- a Salesforce use case
Apache BookKeeper Distributed Store- a Salesforce use caseSalesforce Engineering
 

More from Salesforce Engineering (20)

Locker Service Ready Lightning Components With Webpack
Locker Service Ready Lightning Components With WebpackLocker Service Ready Lightning Components With Webpack
Locker Service Ready Lightning Components With Webpack
 
Scaling HBase for Big Data
Scaling HBase for Big DataScaling HBase for Big Data
Scaling HBase for Big Data
 
Techniques to Effectively Monitor the Performance of Customers in the Cloud
Techniques to Effectively Monitor the Performance of Customers in the CloudTechniques to Effectively Monitor the Performance of Customers in the Cloud
Techniques to Effectively Monitor the Performance of Customers in the Cloud
 
Predictive System Performance Data Analysis
Predictive System Performance Data AnalysisPredictive System Performance Data Analysis
Predictive System Performance Data Analysis
 
Apache HBase State of the Project
Apache HBase State of the ProjectApache HBase State of the Project
Apache HBase State of the Project
 
Hit the Trail with Trailhead
Hit the Trail with TrailheadHit the Trail with Trailhead
Hit the Trail with Trailhead
 
HBase/PHOENIX @ Scale
HBase/PHOENIX @ ScaleHBase/PHOENIX @ Scale
HBase/PHOENIX @ Scale
 
Scaling up data science applications
Scaling up data science applicationsScaling up data science applications
Scaling up data science applications
 
Containers and Security for DevOps
Containers and Security for DevOpsContainers and Security for DevOps
Containers and Security for DevOps
 
Aspect Oriented Programming: Hidden Toolkit That You Already Have
Aspect Oriented Programming: Hidden Toolkit That You Already HaveAspect Oriented Programming: Hidden Toolkit That You Already Have
Aspect Oriented Programming: Hidden Toolkit That You Already Have
 
Monitoring @ Scale in Salesforce
Monitoring @ Scale in SalesforceMonitoring @ Scale in Salesforce
Monitoring @ Scale in Salesforce
 
Performance Tuning with XHProf
Performance Tuning with XHProfPerformance Tuning with XHProf
Performance Tuning with XHProf
 
A Smarter Pig: Building a SQL interface to Pig using Apache Calcite
A Smarter Pig: Building a SQL interface to Pig using Apache CalciteA Smarter Pig: Building a SQL interface to Pig using Apache Calcite
A Smarter Pig: Building a SQL interface to Pig using Apache Calcite
 
Implementing a Content Strategy Is Like Running 100 Miles
Implementing a Content Strategy Is Like Running 100 MilesImplementing a Content Strategy Is Like Running 100 Miles
Implementing a Content Strategy Is Like Running 100 Miles
 
Salesforce Cloud Infrastructure and Challenges - A Brief Overview
Salesforce Cloud Infrastructure and Challenges - A Brief OverviewSalesforce Cloud Infrastructure and Challenges - A Brief Overview
Salesforce Cloud Infrastructure and Challenges - A Brief Overview
 
Finding Security Issues Fast!
Finding Security Issues Fast!Finding Security Issues Fast!
Finding Security Issues Fast!
 
Microservices
MicroservicesMicroservices
Microservices
 
Global State Management of Micro Services
Global State Management of Micro ServicesGlobal State Management of Micro Services
Global State Management of Micro Services
 
The Future of Hbase
The Future of HbaseThe Future of Hbase
The Future of Hbase
 
Apache BookKeeper Distributed Store- a Salesforce use case
Apache BookKeeper Distributed Store- a Salesforce use caseApache BookKeeper Distributed Store- a Salesforce use case
Apache BookKeeper Distributed Store- a Salesforce use case
 

Recently uploaded

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 

Recently uploaded (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 

Koober Preduction IO Presentation

  • 1. PredictionIO Adam Wang Annie Cheng Brett Clancy Erin Chen Han-Wen Chen Siyuan Wang Yiting Wang Koober–
  • 3. Koober Tim’s Challenge + Provide an efficient service for customers and taxi drivers by… Wait Times Revenue
  • 4. Tim has some questions How should taxis be allocated among different locations? Koober What factors drive customers to use taxis more?
  • 5. But most importantly… Koober How can the taxi industry utilize past data to predict the future needs of their riders?
  • 7. Our Solution Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Koober
  • 8. Why Koober? Demand Analysis + Prediction. Empower taxi owners with valuable rider information backed by past trends. Koober Anyone in Ride Sharing Industry. Plan more appropriate routes and make better transportation decisions.
  • 9. So how does it work? Koober 1. Choose your info need: analysis or prediction. OR
  • 10. So how does it work? Koober 2. Adjust your input query parameters.
  • 11. So how does it work? Koober 3. Visualize taxi demand on the heat maps.
  • 12. So how does it work? Koober 4. Make more informed taxi resource decisions.
  • 13. Let’s try it out! https://koober-dashboard.herokuapp.com/ Koober
  • 14. How does it really work? Koober Train and test machine learning models. Display demand on interactive heat maps.
  • 15. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 16. Raw Data NYC Taxi & Limousine Commission: Yellow Taxi Cab Dataset Koober Weather Underground API: • Historical Weather Data • Current Conditions • 10-Day Forecast Task 1
  • 17. Demo Data Loader Koober Task 1 Development Production • Generate fake traffic data based on user-defined parameters • Simplify debugging and validating the model prediction result • Process NYC Taxi Data and integrate with the historical weather data • Incorporate Kafka to facilitate importing large amount of training data
  • 18. Feature Extraction Koober Task 1 {    "eventTime": "2017-01-20T18:54:07.000-05:00",    "lat": 40.713802337646484,    "lng": -77.0088882446289,    "temperature": 0,    "clear": 1,    "fog": 0,    "rain": 0,    "snow": 0,    "hail": 0,    "thunder": 0,    "tornado": 0 } Weather Location Event Time Temperature
  • 19. Location Clustering Koober Preparator: • K-Means Location Clustering (200 clusters) • Each cluster represents a neighborhood geographically • Demand Aggregation per Unit Time Interval and Location Cluster Task 1
  • 20. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 21. Koober Task 2 Gradient-Boosted Trees Linear Regression with Stochastic Gradient Descent Neural Network Random Forest Ridge Regression Multiple Models. Supports many classic individual and combined machine learning models. Models
  • 24. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 25. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 26. Koober Task 4 User Interface Time Location Weather Predict Query Predict Result for Each Algorithm Map GL component Mapbox PredictionIO Engine Web App Map Visualization
  • 27. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 28. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 29. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 30. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 31. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 32. Looking Forward Koober ModelsTraining Data Prediction Modes More Cities Uber/Lyft Better Tuning New Models Wait Times Prices
  • 33. Our Mentor Engineering and Open Source Ambassador at Salesforce.com Koober James Ward www.jamesward.com
  • 34. Our Professor Assistant Professor in Computer Science at Cornell University Koober Ross Tate http://www.cs.cornell.edu/~ross/