SlideShare a Scribd company logo
1 of 25
Download to read offline
Sparking More Meetups with
Machine Learning (ML)
The Data Team @ Meetup
Our Mission
Bring people together in real life
(IRL) to create community and
opportunity for everyone.
Because we need a
more Meetuppy world
People Meetup about (almost)
everything!
The Meetup Model
Groups and Organizers
The Meetup Model
Groups and Organizers
Members
Join Groups
The Meetup Model
Members Groups and Organizers
Meetup IRL
Global
38 million
Members worldwide
320 thousand
Meetup groups
192 countries
Meeting up
3+ million
RSVPs every month
Meetups By the Numbers
300 thousand
Meetups every month
How Machine Learning adds value
● Surface relevance
○ Members see relevant groups
○ New groups get shown to relevant members
○ Members see relevant events
○ Highlight quality events
● Speed up manual tasks
○ New group approval process
Announcing New Groups
● The key problem:
○ New groups don’t have members and need promotion
○ Active member base looking for new communities to join
○ Meetup takes on the responsibility of initial promotion => first
boost in membership
○ Promote via email
○ But don’t blast everybody
○ How do we find the relevant audience?
Announcing New Groups
● Organizer => Name + Location + Description + Topics
● Human + ML => Email announcement
Group
Submitted
Policy
Review
Age /
Gender
Filters
Topics
Review
Group
Approval
Identify
Audience
Email
Announcing New Groups
● Primary outreach -> Email
● ~1M emails / day
● ML identifies recipients
ML Infrastructure - Pre 2017
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Challenge: Scalability
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Challenge: Restricted Feature Complexity
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Challenge: Trained offline, non-scheduled
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Offline Model
Training
Challenge: Training & Deployment Code
Differences
User’s
Topics
User’s
Groups
Other in-memory maps
“Interest” Server
Ranking
Model
New Group Queue
Processing
Cron
Email Queue
API Call
Ranked List
of Users
New Group Email
Offline Model
Training
Key Realization
Does this need to be an
online / real-time process?
Modernizing the ML Infrastructure
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features
Improvement: Richer Features
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features
Improvement: Horizontal scalability
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features
Improvement: Training + Deployment =
Same Code
Data LakeNew Group
Queue
Processing
Cron
Worker 1
Worker 2
Yarn
Cluster
Cache Results
Distributed
Ranking
Model
Email Queue
Ranked List
of Users
New Group
Email
Features
How it Performed
Evaluation:
● 50/50 split test of which model was used
to select who gets an email
Success measurement:
● Joins per Group
RESULT
30%+Joins per GroupOther Learnings:
● Processing time < 24 hours ⇒ 24-48 hours
● Additional delay => manage expectations
with organizers
Lessons Learned
● Keeping data in sync (rsync, sqoop, flume)
● Spark needs tuning
● Horizontal scaling not always an answer
● Airflow local testing
● Dealing with “random” failures
Needs Improvement
● Sampling
● Combining batch + online
● Sharing features
● Understanding model performance after deployment
● Predicting who’s going to “show up”
We’re Hiring
View Open Jobs @ Meetup
Or Email: Shayak

More Related Content

Similar to Sparking more meetups with machine learning

Chapter congress Q&A 3.14.12
Chapter congress Q&A   3.14.12Chapter congress Q&A   3.14.12
Chapter congress Q&A 3.14.12
ynpnnational
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Safe Software
 
Enhancement and Enrichment of Digital Content by User Communities: The Austra...
Enhancement and Enrichment of Digital Content by User Communities: The Austra...Enhancement and Enrichment of Digital Content by User Communities: The Austra...
Enhancement and Enrichment of Digital Content by User Communities: The Austra...
Rose Holley
 

Similar to Sparking more meetups with machine learning (20)

Leverage Chapters to Engage & Grow Membership
Leverage Chapters to Engage & Grow MembershipLeverage Chapters to Engage & Grow Membership
Leverage Chapters to Engage & Grow Membership
 
Data-Driven Online Community Management
Data-Driven Online Community ManagementData-Driven Online Community Management
Data-Driven Online Community Management
 
Gamification review 1
Gamification review 1Gamification review 1
Gamification review 1
 
Chapter & National Databases - Do or Don't They Interface
Chapter & National Databases - Do or Don't They InterfaceChapter & National Databases - Do or Don't They Interface
Chapter & National Databases - Do or Don't They Interface
 
Using Dgroups in all its features
Using Dgroups in all its featuresUsing Dgroups in all its features
Using Dgroups in all its features
 
How to Read & Leverage Online Community Analytics
How to Read & Leverage Online Community AnalyticsHow to Read & Leverage Online Community Analytics
How to Read & Leverage Online Community Analytics
 
North American Collaboration Summit: Analyzing Teams engagement & adoption - ...
North American Collaboration Summit: Analyzing Teams engagement & adoption - ...North American Collaboration Summit: Analyzing Teams engagement & adoption - ...
North American Collaboration Summit: Analyzing Teams engagement & adoption - ...
 
If job boards died - where would you get your candidates?
If job boards died - where would you get your candidates?If job boards died - where would you get your candidates?
If job boards died - where would you get your candidates?
 
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn SearchStructure, Personalization, Scale: A Deep Dive into LinkedIn Search
Structure, Personalization, Scale: A Deep Dive into LinkedIn Search
 
Search Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital AssistantsSearch Tasks, Proactive Search & Digital Assistants
Search Tasks, Proactive Search & Digital Assistants
 
Chapter congress Q&A 3.14.12
Chapter congress Q&A   3.14.12Chapter congress Q&A   3.14.12
Chapter congress Q&A 3.14.12
 
Developing for Office365 groups
Developing for Office365 groupsDeveloping for Office365 groups
Developing for Office365 groups
 
Information Search
Information SearchInformation Search
Information Search
 
Arti Overview
Arti OverviewArti Overview
Arti Overview
 
Using LinkedIn for Job Search
Using LinkedIn for Job Search  Using LinkedIn for Job Search
Using LinkedIn for Job Search
 
Scrum Club System
Scrum Club SystemScrum Club System
Scrum Club System
 
Utilizing The Power of LinkedIn Groups
Utilizing The Power of LinkedIn GroupsUtilizing The Power of LinkedIn Groups
Utilizing The Power of LinkedIn Groups
 
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdfIntroducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
Introducing the New FME Community Webinar - Feb 21, 2024 (2).pdf
 
Enhancement and Enrichment of Digital Content by User Communities: The Aust...
Enhancement and Enrichment of Digital Content by User Communities: The Aust...Enhancement and Enrichment of Digital Content by User Communities: The Aust...
Enhancement and Enrichment of Digital Content by User Communities: The Aust...
 
Enhancement and Enrichment of Digital Content by User Communities: The Austra...
Enhancement and Enrichment of Digital Content by User Communities: The Austra...Enhancement and Enrichment of Digital Content by User Communities: The Austra...
Enhancement and Enrichment of Digital Content by User Communities: The Austra...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Sparking more meetups with machine learning

  • 1. Sparking More Meetups with Machine Learning (ML) The Data Team @ Meetup
  • 2. Our Mission Bring people together in real life (IRL) to create community and opportunity for everyone. Because we need a more Meetuppy world
  • 3. People Meetup about (almost) everything!
  • 4. The Meetup Model Groups and Organizers
  • 5. The Meetup Model Groups and Organizers Members Join Groups
  • 6. The Meetup Model Members Groups and Organizers Meetup IRL
  • 8. 38 million Members worldwide 320 thousand Meetup groups 192 countries Meeting up 3+ million RSVPs every month Meetups By the Numbers 300 thousand Meetups every month
  • 9. How Machine Learning adds value ● Surface relevance ○ Members see relevant groups ○ New groups get shown to relevant members ○ Members see relevant events ○ Highlight quality events ● Speed up manual tasks ○ New group approval process
  • 10. Announcing New Groups ● The key problem: ○ New groups don’t have members and need promotion ○ Active member base looking for new communities to join ○ Meetup takes on the responsibility of initial promotion => first boost in membership ○ Promote via email ○ But don’t blast everybody ○ How do we find the relevant audience?
  • 11. Announcing New Groups ● Organizer => Name + Location + Description + Topics ● Human + ML => Email announcement Group Submitted Policy Review Age / Gender Filters Topics Review Group Approval Identify Audience Email
  • 12. Announcing New Groups ● Primary outreach -> Email ● ~1M emails / day ● ML identifies recipients
  • 13. ML Infrastructure - Pre 2017 User’s Topics User’s Groups Other in-memory maps “Interest” Server Ranking Model New Group Queue Processing Cron Email Queue API Call Ranked List of Users New Group Email
  • 14. Challenge: Scalability User’s Topics User’s Groups Other in-memory maps “Interest” Server Ranking Model New Group Queue Processing Cron Email Queue API Call Ranked List of Users New Group Email
  • 15. Challenge: Restricted Feature Complexity User’s Topics User’s Groups Other in-memory maps “Interest” Server Ranking Model New Group Queue Processing Cron Email Queue API Call Ranked List of Users New Group Email
  • 16. Challenge: Trained offline, non-scheduled User’s Topics User’s Groups Other in-memory maps “Interest” Server Ranking Model New Group Queue Processing Cron Email Queue API Call Ranked List of Users New Group Email Offline Model Training
  • 17. Challenge: Training & Deployment Code Differences User’s Topics User’s Groups Other in-memory maps “Interest” Server Ranking Model New Group Queue Processing Cron Email Queue API Call Ranked List of Users New Group Email Offline Model Training
  • 18. Key Realization Does this need to be an online / real-time process?
  • 19. Modernizing the ML Infrastructure Data LakeNew Group Queue Processing Cron Worker 1 Worker 2 Yarn Cluster Cache Results Distributed Ranking Model Email Queue Ranked List of Users New Group Email Features
  • 20. Improvement: Richer Features Data LakeNew Group Queue Processing Cron Worker 1 Worker 2 Yarn Cluster Cache Results Distributed Ranking Model Email Queue Ranked List of Users New Group Email Features
  • 21. Improvement: Horizontal scalability Data LakeNew Group Queue Processing Cron Worker 1 Worker 2 Yarn Cluster Cache Results Distributed Ranking Model Email Queue Ranked List of Users New Group Email Features
  • 22. Improvement: Training + Deployment = Same Code Data LakeNew Group Queue Processing Cron Worker 1 Worker 2 Yarn Cluster Cache Results Distributed Ranking Model Email Queue Ranked List of Users New Group Email Features
  • 23. How it Performed Evaluation: ● 50/50 split test of which model was used to select who gets an email Success measurement: ● Joins per Group RESULT 30%+Joins per GroupOther Learnings: ● Processing time < 24 hours ⇒ 24-48 hours ● Additional delay => manage expectations with organizers
  • 24. Lessons Learned ● Keeping data in sync (rsync, sqoop, flume) ● Spark needs tuning ● Horizontal scaling not always an answer ● Airflow local testing ● Dealing with “random” failures Needs Improvement ● Sampling ● Combining batch + online ● Sharing features ● Understanding model performance after deployment ● Predicting who’s going to “show up”
  • 25. We’re Hiring View Open Jobs @ Meetup Or Email: Shayak