Sparking more meetups with machine learning

Sparking More Meetups with
Machine Learning (ML)
The Data Team @ Meetup

Our Mission
Bring people together in real life
(IRL) to create community and
opportunity for everyone.
Because we need a
more Meetuppy world

People Meetup about (almost)
everything!

The Meetup Model
Groups and Organizers

The Meetup Model
Groups and Organizers
Members
Join Groups

The Meetup Model
Members Groups and Organizers
Meetup IRL

38 million
Members worldwide
320 thousand
Meetup groups
192 countries
Meeting up
3+ million
RSVPs every month
Meetups By the Numbers
300 thousand
Meetups every month

How Machine Learning adds value
● Surface relevance
○ Members see relevant groups
○ New groups get shown to relevant members
○ Members see relevant events
○ Highlight quality events
● Speed up manual tasks
○ New group approval process

Announcing New Groups
● The key problem:
○ New groups don’t have members and need promotion
○ Active member base looking for new communities to join
○ Meetup takes on the responsibility of initial promotion => first
boost in membership
○ Promote via email
○ But don’t blast everybody
○ How do we find the relevant audience?

● Organizer => Name + Location + Description + Topics
● Human + ML => Email announcement
Group
Submitted
Policy
Review
Age /
Gender
Filters
Topics
Review
Group
Approval
Identify
Audience
Email

● Primary outreach -> Email
● ~1M emails / day
● ML identifies recipients

ML Infrastructure - Pre 2017
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email

Challenge: Scalability
User’s
Topics
User’s
Groups
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email

Challenge: Restricted Feature Complexity
User’s
Topics
User’s
Groups
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email

Challenge: Trained offline, non-scheduled
User’s
Topics
User’s
Groups
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Offline Model
Training

Challenge: Training & Deployment Code
Differences
User’s
Topics
User’s
Groups
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Offline Model
Training

Key Realization
Does this need to be an
online / real-time process?

Modernizing the ML Infrastructure
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features

Improvement: Richer Features
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features

Improvement: Horizontal scalability
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features

Improvement: Training + Deployment =
Same Code
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features

How it Performed
Evaluation:
● 50/50 split test of which model was used
to select who gets an email
Success measurement:
● Joins per Group
RESULT
30%+Joins per GroupOther Learnings:
● Processing time < 24 hours ⇒ 24-48 hours
● Additional delay => manage expectations
with organizers

Lessons Learned
● Keeping data in sync (rsync, sqoop, flume)
● Spark needs tuning
● Horizontal scaling not always an answer
● Airflow local testing
● Dealing with “random” failures
Needs Improvement
● Sampling
● Combining batch + online
● Sharing features
● Understanding model performance after deployment
● Predicting who’s going to “show up”

We’re Hiring
View Open Jobs @ Meetup
Or Email: Shayak

Sparking more meetups with machine learning

Recommended

Recommended

More Related Content

Similar to Sparking more meetups with machine learning

Similar to Sparking more meetups with machine learning (20)

Recently uploaded

Recently uploaded (20)

Sparking more meetups with machine learning