SlideShare a Scribd company logo
1 of 35
Download to read offline
HackYrNews:
design a personalized feed
for Hacker News
Andrew Kontaxis
Let’s browse HN
Page 4…
Let’s browse HN
Page 4…
Page 7…
Let’s browse HN
Page 4…
Page 7…
Page 8…
Let’s browse HN
Page 4…
Page 7…
Page 8…
Page 10…
Let’s browse HN
Filtering posts given
user interests
Filtering posts given
user interests
Capture how interests are grouped within
the community.
Filtering posts given
user interests
Capture how interests are grouped within
the community.
Exploration, flexible input.
DEMO
Under the hood…
Data: post titles, comment text for 30K posts.
Under the hood…
Data: post titles, comment text for 30K posts.
Model: word2vec model trained on comment text.
Under the hood…
Data: post titles, comment text for 30K posts.
Model: word2vec model trained on comment text.
python
amazon
app
web
vocabulary
apple
Word2Vec space W
web
python
amazon
julia
app
apple
julia
Recommendation, Step 1
Recommending posts
1. Map each user interest to a vector.
2. Map each post to a vector.
3. For each interest, recommend the “nearest”
30 posts.
Recommendation, Step 1
Recommending posts
1. Map each user interest to a vector.
2. Map each post to a vector.
3. For each interest, recommend the “nearest”
30 posts.
Word2Vec space W
Interests
apple,
+python|-web
Word2Vec space W
Interests
apple,
+python|-web
apple
Word2Vec space W
python
webInterests
apple,
+python|-web
apple
-web
Word2Vec space W
python
webInterests
apple,
+python|-web
apple
-web+python|-web
Recommendation, Step 1
Recommending posts
1. Map each user interest to a vector.
2. Map each post to a vector.
3. For each interest, recommend the “nearest”
30 posts.
Word2Vec space W
Post #7:
Amazon App store
Word2Vec space W
amazon
app
store
Post #7:
Amazon App store
Word2Vec space W
Post #7:
Amazon App store
amazon
app
store
Post #7
Recommendation, Step 1
Recommending posts
1. Map each user interest to a vector.
2. Map each post to a vector.
3. For each interest, recommend the “nearest”
30 posts.
Word2Vec space W
Interests
apple,
+python|-web apple
+python|-web
Post #7
Word2Vec space W
Interests
apple,
+python|-web apple
+python|-web
Post #7
Suggestions
Post #7:
“Amazon app store”
Recommendation, Step 1
Recommending posts
1. Map each user interest to a vector.
2. Map each post to a vector.
3. For each interest, recommend the “nearest”
30 posts.
About me
Interests:
+cooking|-baking,
+banjo|-bluegrass
Why this setup?
• Initial thought: some kind of topic modeling.
• User input is inflexible (topics are incoherent if we
use more than 10).
• Ideal use case is streaming, but number of topics
must be chosen manually, and “meanings” must
be assigned to topics manually.
• Similar remarks apply to clustering + dimension
reduction.

More Related Content

Similar to hackYrNews_talk

WordCamp Ireland - 40 tips for WordPress Optimization
WordCamp Ireland - 40 tips for WordPress OptimizationWordCamp Ireland - 40 tips for WordPress Optimization
WordCamp Ireland - 40 tips for WordPress OptimizationJoost de Valk
 
WordPress Optimisation Strategies
WordPress Optimisation StrategiesWordPress Optimisation Strategies
WordPress Optimisation Strategiesauexpo Conference
 
WordPress Optimisation - A4UExpo
WordPress Optimisation - A4UExpoWordPress Optimisation - A4UExpo
WordPress Optimisation - A4UExpoJoost de Valk
 
2016 Marketing Blog Post Template
2016 Marketing Blog Post Template2016 Marketing Blog Post Template
2016 Marketing Blog Post Templateunfunnel
 
J query aosddemo_2004
J query aosddemo_2004J query aosddemo_2004
J query aosddemo_2004aasishj
 
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Meta
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Metaapidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Meta
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Metaapidays
 
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)Thinkful
 
Secrets of Awesome JavaScript API Design
Secrets of Awesome JavaScript API DesignSecrets of Awesome JavaScript API Design
Secrets of Awesome JavaScript API DesignBrandon Satrom
 
Finding Missing Tweets using Topic Structure and Browsing Time
Finding Missing Tweets using Topic Structure and Browsing TimeFinding Missing Tweets using Topic Structure and Browsing Time
Finding Missing Tweets using Topic Structure and Browsing Timeysuzuki-naist
 
DataEngConf: Building the Next New York Times Recommendation Engine
DataEngConf: Building the Next New York Times Recommendation EngineDataEngConf: Building the Next New York Times Recommendation Engine
DataEngConf: Building the Next New York Times Recommendation EngineHakka Labs
 
How to Build an Accessible WordPress Theme
How to Build an Accessible WordPress ThemeHow to Build an Accessible WordPress Theme
How to Build an Accessible WordPress ThemeGraham Armfield
 
How to start a blog
How to start a blogHow to start a blog
How to start a blogPat Law
 
Socialite, the Open Source Status Feed Part 2: Managing the Social Graph
Socialite, the Open Source Status Feed Part 2: Managing the Social GraphSocialite, the Open Source Status Feed Part 2: Managing the Social Graph
Socialite, the Open Source Status Feed Part 2: Managing the Social GraphMongoDB
 
Guest author manual
Guest author manualGuest author manual
Guest author manualBerk Soysal
 
ExStreamlycheap Final Slides
ExStreamlycheap Final SlidesExStreamlycheap Final Slides
ExStreamlycheap Final SlidesEmmanuel Awa
 
Web 2.0 basics revised april 2010
Web 2.0 basics revised april 2010Web 2.0 basics revised april 2010
Web 2.0 basics revised april 2010guestff9dca
 

Similar to hackYrNews_talk (20)

Recsys 2016
Recsys 2016Recsys 2016
Recsys 2016
 
WordCamp Ireland - 40 tips for WordPress Optimization
WordCamp Ireland - 40 tips for WordPress OptimizationWordCamp Ireland - 40 tips for WordPress Optimization
WordCamp Ireland - 40 tips for WordPress Optimization
 
WordPress Optimisation Strategies
WordPress Optimisation StrategiesWordPress Optimisation Strategies
WordPress Optimisation Strategies
 
WordPress Optimisation - A4UExpo
WordPress Optimisation - A4UExpoWordPress Optimisation - A4UExpo
WordPress Optimisation - A4UExpo
 
2016 Marketing Blog Post Template
2016 Marketing Blog Post Template2016 Marketing Blog Post Template
2016 Marketing Blog Post Template
 
J query aosddemo_2004
J query aosddemo_2004J query aosddemo_2004
J query aosddemo_2004
 
Gamifying Wikimedia
Gamifying WikimediaGamifying Wikimedia
Gamifying Wikimedia
 
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Meta
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Metaapidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Meta
apidays LIVE Paris 2021 - Data Deletion at Facebook by Benoît Reitz, Meta
 
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)
Build a Web App with JavaScript and jQuery (5:18:17, Los Angeles)
 
Secrets of Awesome JavaScript API Design
Secrets of Awesome JavaScript API DesignSecrets of Awesome JavaScript API Design
Secrets of Awesome JavaScript API Design
 
Finding Missing Tweets using Topic Structure and Browsing Time
Finding Missing Tweets using Topic Structure and Browsing TimeFinding Missing Tweets using Topic Structure and Browsing Time
Finding Missing Tweets using Topic Structure and Browsing Time
 
DataEngConf: Building the Next New York Times Recommendation Engine
DataEngConf: Building the Next New York Times Recommendation EngineDataEngConf: Building the Next New York Times Recommendation Engine
DataEngConf: Building the Next New York Times Recommendation Engine
 
Publish Me a River
Publish Me a RiverPublish Me a River
Publish Me a River
 
How to Build an Accessible WordPress Theme
How to Build an Accessible WordPress ThemeHow to Build an Accessible WordPress Theme
How to Build an Accessible WordPress Theme
 
How to start a blog
How to start a blogHow to start a blog
How to start a blog
 
Socialite, the Open Source Status Feed Part 2: Managing the Social Graph
Socialite, the Open Source Status Feed Part 2: Managing the Social GraphSocialite, the Open Source Status Feed Part 2: Managing the Social Graph
Socialite, the Open Source Status Feed Part 2: Managing the Social Graph
 
Guest author manual
Guest author manualGuest author manual
Guest author manual
 
Effective Java
Effective JavaEffective Java
Effective Java
 
ExStreamlycheap Final Slides
ExStreamlycheap Final SlidesExStreamlycheap Final Slides
ExStreamlycheap Final Slides
 
Web 2.0 basics revised april 2010
Web 2.0 basics revised april 2010Web 2.0 basics revised april 2010
Web 2.0 basics revised april 2010
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

hackYrNews_talk