Trulia Estimates 2.0

•

1 like•507 views

Praneet Mhatre

Education

Motivation
• Trulia Estimates launched in 2011
• Public records snowball has evolved since then, but the valuation
algorithm has not
• Valuations already have a lot of visibility (valuation heatmaps etc)
and we are planning to give them even more visibility in the near
future (valuations history)
• Brilliant Basics – Improve estimates before surfacing them
everywhere

Us v/s Competition
0 5 10 15
Trulia
Estimates
Zestimate
Median Error %
Trulia
Estimates
Zestimate

Our Work
• Location specific and temporal features
• Crime Safety
• School Proximity
• Stats and Trends
• New Geoscopes
• Solve the problem of geographic boundaries
• Model Learning Improvements
• Explicit modeling of location hierarchies
• Better learned parameters
• Better feature representation and normalization

New Features
8.97
8.78
8.82
8.84
8.65
8.7
8.75
8.8
8.85
8.9
8.95
9
Baseline Add
CrimeScore
only
Add
SchoolScore
only
Add avg
ppsqft/ hood
only
Improvement by Individual Features
Median Error
Percentage

New Geoscopes
 After the initial pass
 Coverage improved by 1.67% ~ 1.15million properties throughout the
nation
 330 more counties valued
 For San Mateo, median error goes from 8.97% to 8.85%

Model Learning Improvements
 Each geography is different. Static set of model parameters not
always ideal
 Using cross validation to learn parameters for each location model
from data
 Median error % improves from 8.97 to 8.69 (~3% relative improvement)
 Hierarchical Modeling
 Explicitly model Location Hierarchies to get smoother estimates using
higher level information

What’s Next?
 Spend more time optimizing new features – Optimization is
everything!
 Add price trends data to the hedonic model and simplify our learning
process
 Make per model parameter optimization scalable
 Incorporate hierarchical models into the existing mix

Similar to Trulia Estimates 2.0

Continuous Learning Systems: Building ML systems that learn from their mistakesAnuj Gupta

Day 1 1620 - 1705 - maple - pranabendu bhattacharyyaPMI2011

Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02PMI_IREP_TP

DataEngConf SF16 - Three lessons learned from building a production machine l...Hakka Labs

DefectmodelsinSparseenvironmentspbaxter

7 Reasons Why Value Stream Integration Improves Software Quality assuranceTasktop

Automatic Forecasting at ScaleSean Taylor

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...Jonathon Hare

Horizon: Deep Reinforcement Learning at ScaleDatabricks

[DSC Europe 22] Starting deep learning projects without sufficient amount of ...DataScienceConferenc1

Value Stream Mapping – Stories From the TrenchesDevOps.com

Edwin Van Loon - How Much Testing is Enough - EuroSTAR 2010TEST Huddle

The Machine Learning Workflow with AzureIvo Andreev

Transport Modelling for managers 2014 willumsenLuis Willumsen

Kaggle Gold Medal Case StudyAlon Bochman, CFA

Quantitative Forecasting Techniques in SCMYountek1

SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...NAVER Engineering

Agile 2014- Metrics driven development and devopsKarthik Gaekwad

Metrics Driven Development and DevOps - Agile 2014Ernest Mueller

Anton Muzhailo - Practical Test Process Improvement using ISTQBIevgenii Katsan

Similar to Trulia Estimates 2.0 (20)

Continuous Learning Systems: Building ML systems that learn from their mistakes

Day 1 1620 - 1705 - maple - pranabendu bhattacharyya

Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02

DataEngConf SF16 - Three lessons learned from building a production machine l...

DefectmodelsinSparseenvironments

7 Reasons Why Value Stream Integration Improves Software Quality assurance

Automatic Forecasting at Scale

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...

Horizon: Deep Reinforcement Learning at Scale

[DSC Europe 22] Starting deep learning projects without sufficient amount of ...

Value Stream Mapping – Stories From the Trenches

Edwin Van Loon - How Much Testing is Enough - EuroSTAR 2010

The Machine Learning Workflow with Azure

Transport Modelling for managers 2014 willumsen

Kaggle Gold Medal Case Study

Quantitative Forecasting Techniques in SCM

SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...

Agile 2014- Metrics driven development and devops

Metrics Driven Development and DevOps - Agile 2014

Anton Muzhailo - Practical Test Process Improvement using ISTQB

Recently uploaded

ESSENTIAL of (CS/IT/IS) class 07 (Networks)Dr. Mazin Mohamed alkathiri

Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...EADTU

An Overview of the Odoo 17 Knowledge AppCeline George

DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMELOISARIVERA8

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...Nguyen Thanh Tu Collection

AIM of Education-Teachers Training-2024.pptNishitharanjan Rout

Spellings Wk 4 and Wk 5 for Grade 4 at CAPSAnaAcapella

Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxLimon Prince

PSYPACT- Practicing Over State Lines May 2024.pptxMarlene Maheu

e-Sealing at EADTU by Kamakshi RajagopalEADTU

OS-operating systems- ch05 (CPU Scheduling) ...Dr. Mazin Mohamed alkathiri

ANTI PARKISON DRUGS.pptxPoojaSen20

VAMOS CUIDAR DO NOSSO PLANETA! .Colégio Santa Teresinha

How to Manage Website in Odoo 17 Studio App.pptxCeline George

Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSean M. Fox

Mattingly "AI and Prompt Design: LLMs with NER"National Information Standards Organization (NISO)

How To Create Editable Tree View in Odoo 17Celine George

Trauma-Informed Leadership - Five Practical PrinciplesPooky Knightsmith

會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽中央社

8 Tips for Effective Working Capital ManagementMBA Assignment Experts

Recently uploaded (20)

ESSENTIAL of (CS/IT/IS) class 07 (Networks)

Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...

An Overview of the Odoo 17 Knowledge App

DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...

AIM of Education-Teachers Training-2024.ppt

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS

Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx

PSYPACT- Practicing Over State Lines May 2024.pptx

e-Sealing at EADTU by Kamakshi Rajagopal

OS-operating systems- ch05 (CPU Scheduling) ...

ANTI PARKISON DRUGS.pptx

VAMOS CUIDAR DO NOSSO PLANETA! .

How to Manage Website in Odoo 17 Studio App.pptx

Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room

Mattingly "AI and Prompt Design: LLMs with NER"

How To Create Editable Tree View in Odoo 17

Trauma-Informed Leadership - Five Practical Principles

會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽

8 Tips for Effective Working Capital Management

Trulia Estimates 2.0

1. Trulia Estimates v2.0

3. Motivation • Trulia Estimates launched in 2011 • Public records snowball has evolved since then, but the valuation algorithm has not • Valuations already have a lot of visibility (valuation heatmaps etc) and we are planning to give them even more visibility in the near future (valuations history) • Brilliant Basics – Improve estimates before surfacing them everywhere

4. Us v/s Competition 0 5 10 15 Trulia Estimates Zestimate Median Error % Trulia Estimates Zestimate

5. Our Work • Location specific and temporal features • Crime Safety • School Proximity • Stats and Trends • New Geoscopes • Solve the problem of geographic boundaries • Model Learning Improvements • Explicit modeling of location hierarchies • Better learned parameters • Better feature representation and normalization

6. New Features 8.97 8.78 8.82 8.84 8.65 8.7 8.75 8.8 8.85 8.9 8.95 9 Baseline Add CrimeScore only Add SchoolScore only Add avg ppsqft/ hood only Improvement by Individual Features Median Error Percentage

7. New Geoscopes

8. New Geoscopes

9. New Geoscopes  After the initial pass  Coverage improved by 1.67% ~ 1.15million properties throughout the nation  330 more counties valued  For San Mateo, median error goes from 8.97% to 8.85%

10. Model Learning Improvements  Each geography is different. Static set of model parameters not always ideal  Using cross validation to learn parameters for each location model from data  Median error % improves from 8.97 to 8.69 (~3% relative improvement)  Hierarchical Modeling  Explicitly model Location Hierarchies to get smoother estimates using higher level information

11. What’s Next?  Spend more time optimizing new features – Optimization is everything!  Add price trends data to the hedonic model and simplify our learning process  Make per model parameter optimization scalable  Incorporate hierarchical models into the existing mix

Trulia Estimates 2.0

Recommended

Recommended

More Related Content

Similar to Trulia Estimates 2.0

Similar to Trulia Estimates 2.0 (20)

Recently uploaded

Recently uploaded (20)

Trulia Estimates 2.0