SlideShare a Scribd company logo
1 of 34
Download to read offline
1. Meetup 101
2. The data team @ Meetup
3. ML product considerations
Alex Charnas, Product Manager
Ben Schulte, Sr. Engineering Director
Zachary Cohn, Principal Engineer
1. Meetup 101
Our Mission
Empower personal growth through
real human connections
More real community
Groups are about (almost) everything!
How the platform works
Groups and Organizers
How the platform works
Groups and Organizers
How the platform works
Groups and Organizers
How the platform works
Groups and Organizers
Members
Join Groups
How the platform works
Members
Events In Real Life
Groups and Organizers
40 million Members
in 192 countries
300,000 Groups
3+ million RSVPs
every month
Meetup by the Numbers
300,000 Events
every month
2. Data team @ Meetup
Data and Machine Learning Mission
Data and analytics drives impact for the entire organization
● understand impact
● identify opportunities
● improve the customer experience
Machine learning directly improves the customer experience
● personalization -- batch & low latency
● insights at scale
How is the Data team organized @ Meetup?
Machine Learning (ML)
Build quality and relevance into Meetup with
customer products and reusable APIs
Data Science (DS)
Deep insights into Meetup activity and
experimentation for internal customers
Data Platform (DP)
The bedrock for low-latency, accurate data
that power DS, ML and analytics
How do the teams work together?
DP ∩ ML
● Implement & operate a machine learning platform to bring ML product to our members
● Empower other teams to use ML models & insights in their products
DS ∩ DP
● Collect, organize and enhance analytic data
● Provide trusted, performant & self-service access to Meetup data & insights
Machine Learning (ML)
Build quality and relevance into Meetup with
customer products and reusable APIs
● Connect members & organizers through
high-quality, highly relevant
recommendations
● Maintain a library of reusable attributes
describing our members, groups &
events
Data Science (DS)
Deep insights into Meetup activity and
experimentation for internal customers
● Establish, maintain and expand a set of
ground truths describing Meetup activity
● Maintain an experiment framework that is
trusted & used by PMs & engineers
Data Platform (DP)
The bedrock for low-latency, accurate data
that power DS, ML and analytics
● Ensure ongoing data fidelity,
low-latency data access and system
stability
● Provide tools for internal customers
to simplify data access and make
development at scale easy
DS ∩ ML
● Apply statistics at scale to describe &
predict meetup activity
Where does this fit in the organization?
3. Considerations for developing
ML product
1. Do you improve tools or the product?
Tools
● Decrease the cost ($$$)
● Reduce modeling / iteration
cycle time
● Add better data, feature,
model tracking
Product features
● New features
● New models
● Discovery / research
false choice!Correct answer…
Ideally* you improve the tools via product work:
Meetup ML product release New tooling added & now reused throughout platform
New Group Announcement Reusable feature library & distributed XGBoost training
Auto-approve Meetup Groups Low-latency features & auto-model retraining
Member → Group recommendation Airflow scheduling & lambda-served recommendations (burst
capacity!) on AWS
Show-up model Reduce model iteration time
Member → Topic recommendation Cloud compute $$$ pits of success
* $$$ / hours
2. Selecting an Objective Function
● How will success be measured?
● What should the machine try to learn?
Example: New Group Announcement
Example: New Group Announcement
Joins
But we care about lots of stuff
● Joins per email but also...
○ Are they RSVPing to the events later?
○ Are we seeing an increase in unsubscribes?
○ Do we see an increase in new group successful starts?
But we care about lots of stuff
● Joins per email but also...
○ Are they RSVPing to the events later?
○ Are we seeing an increase in unsubscribes?
○ Do we see an increase in new group successful starts?
● Could try to find one metric to rule them all
○ We prefer a straightforward and interpretable key indicator
○ Other metrics are balancing: look at only to identify problems
3. Making progress on projects crossing domains
Neighbor’s
fence
Neighbor’s yard
What do you do?
1)
2)
3)
Team
ownership
Another team’s
domain
ML Engineers
ML Product
What do you do?
1)
2)
3)
4. How to prioritize having data?
I often say that when you can measure what you are
speaking about, and express it in numbers, you
know something about it; but when you cannot
measure it, when you cannot express it in numbers,
your knowledge is of a meagre and unsatisfactory
kind; it may be the beginning of knowledge, but you
have scarcely, in your thoughts, advanced to the
stage of science, whatever the matter may be.
-- Lord Kelvin (and not a pithy Peter Drucker quote.)
Back to the Future
● Data is the lifeblood of machine learning.
Back to the Future
● Data is the lifeblood of machine learning.
● Observing the past is easier than predicting the future.
● Observing the past is hard!
Back to the Future
● Data is the lifeblood of machine learning.
● Observing the past is easier than predicting the future.
● Observing the past is hard!
● Training requires predicting the future, in the past.
○ That sounds easy -- it’s already in the past.
○ But you need a representation of the state
of the world at arbitrary points of history.
5. Translating Local Lift → Global Impact
Starting point: good (not great) impact from new ML model
How do we pump up the added value?
1. Follow the eyeballs → Know where impact is possible (not always easy)
2. Make some friends → what adjacent product could reuse your insight?
3. Socialize your ML portfolio
6. Owned vs. Supported vs. Arbitrated
1. Algorithms aren’t a neutral selection
mechanism -- while they can optimize
content in a “shared” channel (e.g.
what should we promote on our
homepage) these are rarely solely
data-driven decisions.
2. ML teams need a good way to iterate
independently -- offline analysis is
great, but the gold standard is A/B
testing in production. Without a way to
do that, improvements are slower.
Thank you!
meetup.com/jobs
meetup.com/register
medium.com/making-meetup
ben.schulte@meetup.com
zacharycohn@meetup.com
● Save your data
● Define an ML problem from a business need
● Build core capabilities for sustainable innovation
Build
Test
Iterate

More Related Content

What's hot

Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...
Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...
Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...Et Hish
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on PinterestJune Andrews
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellSri Ambati
 
The Next Generation of AI-Powered Search
The Next Generation of AI-Powered SearchThe Next Generation of AI-Powered Search
The Next Generation of AI-Powered SearchLucidworks
 
Design vs Data: Battle Royale (UX+Data Meetup)
Design vs Data: Battle Royale (UX+Data Meetup)Design vs Data: Battle Royale (UX+Data Meetup)
Design vs Data: Battle Royale (UX+Data Meetup)Jess Dale
 
You Mean You Don't Have to Start Over Every Time?
You Mean You Don't Have to Start Over Every Time?You Mean You Don't Have to Start Over Every Time?
You Mean You Don't Have to Start Over Every Time?Andrea L. Ames
 
Become a Data Analyst
Become a Data Analyst Become a Data Analyst
Become a Data Analyst Aaron Lamphere
 
How to think like a data scientist?
How to think like a data scientist?How to think like a data scientist?
How to think like a data scientist?Muyeena Khanzada
 
Dashboards Too Much Information
Dashboards Too Much InformationDashboards Too Much Information
Dashboards Too Much InformationSpectrum
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceLivePerson
 
Data sciences course
Data sciences courseData sciences course
Data sciences courseAJEETPANDEY50
 
Data sciences course pdf
Data sciences course pdfData sciences course pdf
Data sciences course pdfAjeetPandey51
 
Community-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision MakingCommunity-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision Makinggregoryg
 
To Estimate or Not To Estimate + #(No)Estimates Game
To Estimate or Not To Estimate + #(No)Estimates GameTo Estimate or Not To Estimate + #(No)Estimates Game
To Estimate or Not To Estimate + #(No)Estimates GameAgile Humans
 
Playing Nice in the Product Playground #StrataHadoop
Playing Nice in the Product Playground #StrataHadoopPlaying Nice in the Product Playground #StrataHadoop
Playing Nice in the Product Playground #StrataHadoopIntuit Inc.
 
Start Thinking Like a Data Scientist
Start Thinking Like a Data ScientistStart Thinking Like a Data Scientist
Start Thinking Like a Data ScientistAmanMehta47
 
Analytics Lessons Learnt
Analytics Lessons Learnt Analytics Lessons Learnt
Analytics Lessons Learnt Venkata Pingali
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...June Andrews
 
Gsp hpsds l2_7.4_presenting data
Gsp hpsds l2_7.4_presenting dataGsp hpsds l2_7.4_presenting data
Gsp hpsds l2_7.4_presenting dataglobalstudypass
 

What's hot (20)

Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...
Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...
Analysis of "Data is Worthless if You Don’t Communicate It" by Thomas H. Dave...
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on Pinterest
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
 
The Next Generation of AI-Powered Search
The Next Generation of AI-Powered SearchThe Next Generation of AI-Powered Search
The Next Generation of AI-Powered Search
 
Design vs Data: Battle Royale (UX+Data Meetup)
Design vs Data: Battle Royale (UX+Data Meetup)Design vs Data: Battle Royale (UX+Data Meetup)
Design vs Data: Battle Royale (UX+Data Meetup)
 
You Mean You Don't Have to Start Over Every Time?
You Mean You Don't Have to Start Over Every Time?You Mean You Don't Have to Start Over Every Time?
You Mean You Don't Have to Start Over Every Time?
 
Data analysis and brain
Data analysis and brainData analysis and brain
Data analysis and brain
 
Become a Data Analyst
Become a Data Analyst Become a Data Analyst
Become a Data Analyst
 
How to think like a data scientist?
How to think like a data scientist?How to think like a data scientist?
How to think like a data scientist?
 
Dashboards Too Much Information
Dashboards Too Much InformationDashboards Too Much Information
Dashboards Too Much Information
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data sciences course
Data sciences courseData sciences course
Data sciences course
 
Data sciences course pdf
Data sciences course pdfData sciences course pdf
Data sciences course pdf
 
Community-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision MakingCommunity-Assisted Software Engineering Decision Making
Community-Assisted Software Engineering Decision Making
 
To Estimate or Not To Estimate + #(No)Estimates Game
To Estimate or Not To Estimate + #(No)Estimates GameTo Estimate or Not To Estimate + #(No)Estimates Game
To Estimate or Not To Estimate + #(No)Estimates Game
 
Playing Nice in the Product Playground #StrataHadoop
Playing Nice in the Product Playground #StrataHadoopPlaying Nice in the Product Playground #StrataHadoop
Playing Nice in the Product Playground #StrataHadoop
 
Start Thinking Like a Data Scientist
Start Thinking Like a Data ScientistStart Thinking Like a Data Scientist
Start Thinking Like a Data Scientist
 
Analytics Lessons Learnt
Analytics Lessons Learnt Analytics Lessons Learnt
Analytics Lessons Learnt
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...
 
Gsp hpsds l2_7.4_presenting data
Gsp hpsds l2_7.4_presenting dataGsp hpsds l2_7.4_presenting data
Gsp hpsds l2_7.4_presenting data
 

Similar to Machine Learning Product Managers Meetup Event

Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analyticsRob Winters
 
What Are the Basics of Product Manager Interviews by Google PM
What Are the Basics of Product Manager Interviews by Google PMWhat Are the Basics of Product Manager Interviews by Google PM
What Are the Basics of Product Manager Interviews by Google PMProduct School
 
Start With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureStart With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureAggregage
 
Start With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureStart With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureBrittanyShear
 
DataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfDataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfJedha Bootcamp
 
Intro to Product Management by Trunk Club Product Manager
Intro to Product Management by Trunk Club Product ManagerIntro to Product Management by Trunk Club Product Manager
Intro to Product Management by Trunk Club Product ManagerProduct School
 
What are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMWhat are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMProduct School
 
Lean UX in the Enterprise: A Government Case Study
Lean UX in the Enterprise: A Government Case StudyLean UX in the Enterprise: A Government Case Study
Lean UX in the Enterprise: A Government Case Studyuxpin
 
Pin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 octPin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 octSteven Martin
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine LearningAgileThought
 
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...NadinaLisbon1
 
Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence The PNR
 
UX, DX, DSX: Developers and Data Scientists as Users
UX, DX, DSX: Developers and Data Scientists as UsersUX, DX, DSX: Developers and Data Scientists as Users
UX, DX, DSX: Developers and Data Scientists as UsersUXDXConf
 
MVP (Minimum Viable Product) Readiness | Boost Labs
MVP (Minimum Viable Product) Readiness | Boost LabsMVP (Minimum Viable Product) Readiness | Boost Labs
MVP (Minimum Viable Product) Readiness | Boost LabsBoost Labs
 
Data-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsData-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsEd Fry
 
Data-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsData-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsHull
 
Redesigning everything ITARC Stockholm 2021
Redesigning everything ITARC Stockholm 2021Redesigning everything ITARC Stockholm 2021
Redesigning everything ITARC Stockholm 2021Alberto Brandolini
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analyticssunnypatil1778
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teamsVenkatesh Umaashankar
 
Why And How to Transition into Product Management by Google PM
Why And How to Transition into Product Management by Google PMWhy And How to Transition into Product Management by Google PM
Why And How to Transition into Product Management by Google PMProduct School
 

Similar to Machine Learning Product Managers Meetup Event (20)

Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analytics
 
What Are the Basics of Product Manager Interviews by Google PM
What Are the Basics of Product Manager Interviews by Google PMWhat Are the Basics of Product Manager Interviews by Google PM
What Are the Basics of Product Manager Interviews by Google PM
 
Start With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureStart With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data Culture
 
Start With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data CultureStart With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data Culture
 
DataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfDataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdf
 
Intro to Product Management by Trunk Club Product Manager
Intro to Product Management by Trunk Club Product ManagerIntro to Product Management by Trunk Club Product Manager
Intro to Product Management by Trunk Club Product Manager
 
What are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMWhat are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PM
 
Lean UX in the Enterprise: A Government Case Study
Lean UX in the Enterprise: A Government Case StudyLean UX in the Enterprise: A Government Case Study
Lean UX in the Enterprise: A Government Case Study
 
Pin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 octPin the tail on the metric v01 2016 oct
Pin the tail on the metric v01 2016 oct
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
 
Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence Lunch and Learn Artificial intelligence
Lunch and Learn Artificial intelligence
 
UX, DX, DSX: Developers and Data Scientists as Users
UX, DX, DSX: Developers and Data Scientists as UsersUX, DX, DSX: Developers and Data Scientists as Users
UX, DX, DSX: Developers and Data Scientists as Users
 
MVP (Minimum Viable Product) Readiness | Boost Labs
MVP (Minimum Viable Product) Readiness | Boost LabsMVP (Minimum Viable Product) Readiness | Boost Labs
MVP (Minimum Viable Product) Readiness | Boost Labs
 
Data-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsData-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized Results
 
Data-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized ResultsData-Driven Growth: Lies, Lawyers & Outsized Results
Data-Driven Growth: Lies, Lawyers & Outsized Results
 
Redesigning everything ITARC Stockholm 2021
Redesigning everything ITARC Stockholm 2021Redesigning everything ITARC Stockholm 2021
Redesigning everything ITARC Stockholm 2021
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analytics
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Why And How to Transition into Product Management by Google PM
Why And How to Transition into Product Management by Google PMWhy And How to Transition into Product Management by Google PM
Why And How to Transition into Product Management by Google PM
 

Recently uploaded

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 

Recently uploaded (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

Machine Learning Product Managers Meetup Event

  • 1. 1. Meetup 101 2. The data team @ Meetup 3. ML product considerations Alex Charnas, Product Manager Ben Schulte, Sr. Engineering Director Zachary Cohn, Principal Engineer
  • 3. Our Mission Empower personal growth through real human connections More real community
  • 4. Groups are about (almost) everything!
  • 5. How the platform works Groups and Organizers
  • 6. How the platform works Groups and Organizers
  • 7. How the platform works Groups and Organizers
  • 8. How the platform works Groups and Organizers Members Join Groups
  • 9. How the platform works Members Events In Real Life Groups and Organizers
  • 10. 40 million Members in 192 countries 300,000 Groups 3+ million RSVPs every month Meetup by the Numbers 300,000 Events every month
  • 11. 2. Data team @ Meetup
  • 12. Data and Machine Learning Mission Data and analytics drives impact for the entire organization ● understand impact ● identify opportunities ● improve the customer experience Machine learning directly improves the customer experience ● personalization -- batch & low latency ● insights at scale
  • 13. How is the Data team organized @ Meetup? Machine Learning (ML) Build quality and relevance into Meetup with customer products and reusable APIs Data Science (DS) Deep insights into Meetup activity and experimentation for internal customers Data Platform (DP) The bedrock for low-latency, accurate data that power DS, ML and analytics
  • 14. How do the teams work together? DP ∩ ML ● Implement & operate a machine learning platform to bring ML product to our members ● Empower other teams to use ML models & insights in their products DS ∩ DP ● Collect, organize and enhance analytic data ● Provide trusted, performant & self-service access to Meetup data & insights Machine Learning (ML) Build quality and relevance into Meetup with customer products and reusable APIs ● Connect members & organizers through high-quality, highly relevant recommendations ● Maintain a library of reusable attributes describing our members, groups & events Data Science (DS) Deep insights into Meetup activity and experimentation for internal customers ● Establish, maintain and expand a set of ground truths describing Meetup activity ● Maintain an experiment framework that is trusted & used by PMs & engineers Data Platform (DP) The bedrock for low-latency, accurate data that power DS, ML and analytics ● Ensure ongoing data fidelity, low-latency data access and system stability ● Provide tools for internal customers to simplify data access and make development at scale easy DS ∩ ML ● Apply statistics at scale to describe & predict meetup activity
  • 15. Where does this fit in the organization?
  • 16. 3. Considerations for developing ML product
  • 17. 1. Do you improve tools or the product? Tools ● Decrease the cost ($$$) ● Reduce modeling / iteration cycle time ● Add better data, feature, model tracking Product features ● New features ● New models ● Discovery / research
  • 18. false choice!Correct answer… Ideally* you improve the tools via product work: Meetup ML product release New tooling added & now reused throughout platform New Group Announcement Reusable feature library & distributed XGBoost training Auto-approve Meetup Groups Low-latency features & auto-model retraining Member → Group recommendation Airflow scheduling & lambda-served recommendations (burst capacity!) on AWS Show-up model Reduce model iteration time Member → Topic recommendation Cloud compute $$$ pits of success * $$$ / hours
  • 19. 2. Selecting an Objective Function ● How will success be measured? ● What should the machine try to learn?
  • 20. Example: New Group Announcement
  • 21. Example: New Group Announcement Joins
  • 22. But we care about lots of stuff ● Joins per email but also... ○ Are they RSVPing to the events later? ○ Are we seeing an increase in unsubscribes? ○ Do we see an increase in new group successful starts?
  • 23. But we care about lots of stuff ● Joins per email but also... ○ Are they RSVPing to the events later? ○ Are we seeing an increase in unsubscribes? ○ Do we see an increase in new group successful starts? ● Could try to find one metric to rule them all ○ We prefer a straightforward and interpretable key indicator ○ Other metrics are balancing: look at only to identify problems
  • 24. 3. Making progress on projects crossing domains Neighbor’s fence Neighbor’s yard
  • 25. What do you do? 1) 2) 3)
  • 27. What do you do? 1) 2) 3)
  • 28. 4. How to prioritize having data? I often say that when you can measure what you are speaking about, and express it in numbers, you know something about it; but when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meagre and unsatisfactory kind; it may be the beginning of knowledge, but you have scarcely, in your thoughts, advanced to the stage of science, whatever the matter may be. -- Lord Kelvin (and not a pithy Peter Drucker quote.)
  • 29. Back to the Future ● Data is the lifeblood of machine learning.
  • 30. Back to the Future ● Data is the lifeblood of machine learning. ● Observing the past is easier than predicting the future. ● Observing the past is hard!
  • 31. Back to the Future ● Data is the lifeblood of machine learning. ● Observing the past is easier than predicting the future. ● Observing the past is hard! ● Training requires predicting the future, in the past. ○ That sounds easy -- it’s already in the past. ○ But you need a representation of the state of the world at arbitrary points of history.
  • 32. 5. Translating Local Lift → Global Impact Starting point: good (not great) impact from new ML model How do we pump up the added value? 1. Follow the eyeballs → Know where impact is possible (not always easy) 2. Make some friends → what adjacent product could reuse your insight? 3. Socialize your ML portfolio
  • 33. 6. Owned vs. Supported vs. Arbitrated 1. Algorithms aren’t a neutral selection mechanism -- while they can optimize content in a “shared” channel (e.g. what should we promote on our homepage) these are rarely solely data-driven decisions. 2. ML teams need a good way to iterate independently -- offline analysis is great, but the gold standard is A/B testing in production. Without a way to do that, improvements are slower.
  • 34. Thank you! meetup.com/jobs meetup.com/register medium.com/making-meetup ben.schulte@meetup.com zacharycohn@meetup.com ● Save your data ● Define an ML problem from a business need ● Build core capabilities for sustainable innovation Build Test Iterate