SlideShare a Scribd company logo
Facebook social
widget
For content discovery
Form of social pressure (2 friends like this)
Two of your close friends like this
- You are more likely to
discover the content
- You are more inclided to
believe/like the content
What if we can do the
same?
Create a blueprint for Future of Work in Factories. Test the
prototype in the customer production Factory, and obtain
first feedback of the prototype from end-users.
Similar Contributions:
Future of Onboarding Learning on Shopfloor
Alicia
Young
Bob
Stone
Joe
McNamara
Yves
Delacroix
Learning:
Learn to make
software Prototypes
Prototyping
Future of Work Factories of the Future
Experimental
Design
Text is analyzed in real time
- Extract Topic Meaning
- Keyword Extraction
Keywords are displayed
Key Influencers are found. You can:
- See their contribution
- Contact them directly
Related Marketplace Projects
Relevant Learning is found.
Add to MyLearning
Create a «Future of Work in Factories» Concept
Similar Contributions:
Future of Onboarding Learning on Shopfloor
Alicia
Young
Bob
Stone
Joe
McNamara
Yves
Delacroix
Future of Work Factories of the Future
Contact Alicia directly through the systemCheck Alicia´s contributions
- See if you shoud align
Align & Connect
Align with people working on the similar things
Find people from my network (that are in my communication
neighborhood)
Find key influencers company-wide
Circle Indicates people from my immediate
network
New Marketplace Project
Title
Description
Working on similar topics:
Bob
Stone
Joe
McNamara
Self-driving buses
The future of public transportation is in self-driving buses, that
are driving through metropolitan and rural areas.
The goal of this project is to create software for self-driving
buses, which would be the backbone of the public
transportation in metropolitan areas.
Self-driving Public Transport Buses
Interested in Contributing:
Yves
Delacroix
Text is analyzed in real time
- Extract Meaning from text
- Keyword Extraction
- Keyword translation to users language
ONA network finds key people to contact
- Based on keywords
- Based on Impact/Influence
Yves is interested in the topic
- Shown as suggested for the project
Invite Yves to contribute
Our Approach: Privacy and GDPR
Easier than feared!
Privacy-by-design (GDPR):
a) Article 4(5) defines pseudonymization as “the
processing of personal data in such a way that the
data can no longer be attributed to a specific data
subject without the use of additional information.”
b) For summary statistics or groups (non-PII) analytics:
Recital 26 defines anonymized data as “data
rendered anonymous in such a way that the data
subject is not or no longer identifiable.”
Cleanup PII
Utilize
PII re-bind
ONA/NLP/ML
Pseudonymize
Dataset
User
Consent
(PII Separation)
NOTE: PII - Personally identifiable information
7
What next?
8
1. Roll-out a GDPR-aware MVP
2. Learn from it and grow Data Product Managers
3. Apply ONA metrics & ML to pioneer new products
NLP and Keyword extraction
Input: Training email dataset
• Sanitization, header removal
• Filter outbound data
Processing:
• Simple NL preprocessing (lowercase, remove special
characters, tokenize)
• Named entity extraction and removal
• Stopwords removal
.
... continued
Processing:
• Lemmatization and part of speech filtering
• Identify bigrams and trigams
Now the data should be clean enough to do actual NLP on it.
.
LDA and RAKE
LDA – Latent Dirichlet Allocation:
• Topic modeling algorithm
• Extracts relevant topics from a corpus
• Unsupervised learning
• Needs number of topics beforehand
RAKE – Rapid Automatic Keyword Extraction:
• Statistical modeling of text
• Creates rank of keywords
• Easy to run, not actually ML
.
Word embeddings and TF/IDF
Word embeddings:
• Used fasttext embeddings – multilanguage
• Expand found keywords by similarity vectors
• Threshold value for similarity
TF/IDF:
• Ranking and relevance for documents
• Using any Lucene based solution
• Get free n-grams and typos
More Info?
Visit our Article Series:
bit.ly/2Rwhg2R
13
Vladimir Mijatovic | Alexandru Filip | Agron Fazliu
February, 2019
Freiburg

More Related Content

Similar to ONA ( organizational network analysis ) to enable individuals to impact their organization - part 2

Step 1 Consider the Purpose for the ResearchYou have been given.docx
Step 1 Consider the Purpose for the ResearchYou have been given.docxStep 1 Consider the Purpose for the ResearchYou have been given.docx
Step 1 Consider the Purpose for the ResearchYou have been given.docx
whitneyleman54422
 
2013_OSCON_Innovation_Presentation
2013_OSCON_Innovation_Presentation2013_OSCON_Innovation_Presentation
2013_OSCON_Innovation_Presentation
Laszlo Szalvay
 
222111Organization N.docx
222111Organization N.docx222111Organization N.docx
222111Organization N.docx
RAJU852744
 
Orlando SFDC User Group 10/2011
Orlando SFDC User Group 10/2011Orlando SFDC User Group 10/2011
Orlando SFDC User Group 10/2011
Joshua Hoskins
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinar
QuestionPro
 

Similar to ONA ( organizational network analysis ) to enable individuals to impact their organization - part 2 (20)

Website Personalization 101
Website Personalization 101Website Personalization 101
Website Personalization 101
 
BEA Pathways - Expertise Location
BEA Pathways - Expertise LocationBEA Pathways - Expertise Location
BEA Pathways - Expertise Location
 
lean + design thinking in building data products
lean + design thinking in building data productslean + design thinking in building data products
lean + design thinking in building data products
 
Enterprise 2.0: The new face of CRM
Enterprise 2.0: The new face of CRMEnterprise 2.0: The new face of CRM
Enterprise 2.0: The new face of CRM
 
Agile UX in Trustpilot
Agile UX in TrustpilotAgile UX in Trustpilot
Agile UX in Trustpilot
 
Startds9.19.17sd
Startds9.19.17sdStartds9.19.17sd
Startds9.19.17sd
 
What are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PMWhat are the Assumptions About Data Products by Hiya.com Lead PM
What are the Assumptions About Data Products by Hiya.com Lead PM
 
How to effectively implement different online research methods - UXPA 2015 - ...
How to effectively implement different online research methods - UXPA 2015 - ...How to effectively implement different online research methods - UXPA 2015 - ...
How to effectively implement different online research methods - UXPA 2015 - ...
 
How to Effectively Implement Different Online Research Techniques for Rapid U...
How to Effectively Implement Different Online Research Techniques for Rapid U...How to Effectively Implement Different Online Research Techniques for Rapid U...
How to Effectively Implement Different Online Research Techniques for Rapid U...
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
AI in linkedin
AI in linkedinAI in linkedin
AI in linkedin
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...
 
Step 1 Consider the Purpose for the ResearchYou have been given.docx
Step 1 Consider the Purpose for the ResearchYou have been given.docxStep 1 Consider the Purpose for the ResearchYou have been given.docx
Step 1 Consider the Purpose for the ResearchYou have been given.docx
 
Machine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXLMachine Learning for SEOs - SMXL
Machine Learning for SEOs - SMXL
 
Feedback Management _ StroriesOnBoard.pdf
Feedback Management _ StroriesOnBoard.pdfFeedback Management _ StroriesOnBoard.pdf
Feedback Management _ StroriesOnBoard.pdf
 
2013_OSCON_Innovation_Presentation
2013_OSCON_Innovation_Presentation2013_OSCON_Innovation_Presentation
2013_OSCON_Innovation_Presentation
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
222111Organization N.docx
222111Organization N.docx222111Organization N.docx
222111Organization N.docx
 
Orlando SFDC User Group 10/2011
Orlando SFDC User Group 10/2011Orlando SFDC User Group 10/2011
Orlando SFDC User Group 10/2011
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinar
 

More from Haufe-Lexware GmbH & Co KG

More from Haufe-Lexware GmbH & Co KG (20)

Tech stackhaufegroup
Tech stackhaufegroupTech stackhaufegroup
Tech stackhaufegroup
 
X-celerate 2019: Iterating fast with the MERN Stack
X-celerate 2019: Iterating fast with the MERN StackX-celerate 2019: Iterating fast with the MERN Stack
X-celerate 2019: Iterating fast with the MERN Stack
 
Nils Rhode - Does it always have to be k8s - TeC Day 2019
Nils Rhode - Does it always have to be k8s - TeC Day 2019Nils Rhode - Does it always have to be k8s - TeC Day 2019
Nils Rhode - Does it always have to be k8s - TeC Day 2019
 
Haufe Onboarding - Fast Iterating With the MERN Stack - TEC Day 2019
Haufe Onboarding - Fast Iterating With the MERN Stack - TEC Day 2019Haufe Onboarding - Fast Iterating With the MERN Stack - TEC Day 2019
Haufe Onboarding - Fast Iterating With the MERN Stack - TEC Day 2019
 
Cloud Journey: Lifting a Major Product to Kubernetes
Cloud Journey: Lifting a Major Product to KubernetesCloud Journey: Lifting a Major Product to Kubernetes
Cloud Journey: Lifting a Major Product to Kubernetes
 
ONA ( organizational network analysis ) enabling individuals to impact their ...
ONA ( organizational network analysis ) enabling individuals to impact their ...ONA ( organizational network analysis ) enabling individuals to impact their ...
ONA ( organizational network analysis ) enabling individuals to impact their ...
 
Using word vectors to enable better search in our legal products
Using word vectors to enable better search in our legal productsUsing word vectors to enable better search in our legal products
Using word vectors to enable better search in our legal products
 
Identifying customer potentials through unsupervised learning
Identifying customer potentials through unsupervised learningIdentifying customer potentials through unsupervised learning
Identifying customer potentials through unsupervised learning
 
Field report: Rapid application development
Field report: Rapid application developmentField report: Rapid application development
Field report: Rapid application development
 
Behavior-Driven Development with JGiven
Behavior-Driven Development with JGivenBehavior-Driven Development with JGiven
Behavior-Driven Development with JGiven
 
Externalized Spring Boot App Configuration
Externalized  Spring Boot App ConfigurationExternalized  Spring Boot App Configuration
Externalized Spring Boot App Configuration
 
Managing short lived Kubernetes (Production) deployments
Managing short lived Kubernetes (Production) deploymentsManaging short lived Kubernetes (Production) deployments
Managing short lived Kubernetes (Production) deployments
 
Docker in Production at the Aurora Team
Docker in Production at the Aurora TeamDocker in Production at the Aurora Team
Docker in Production at the Aurora Team
 
DevOps Journey of Foundational Services at Haufe
DevOps Journey of Foundational Services at HaufeDevOps Journey of Foundational Services at Haufe
DevOps Journey of Foundational Services at Haufe
 
New Serverless World - Cloud Native Apps
New Serverless World - Cloud Native AppsNew Serverless World - Cloud Native Apps
New Serverless World - Cloud Native Apps
 
Microservice Transformation of the Haufe Publishing System
Microservice Transformation of the Haufe Publishing SystemMicroservice Transformation of the Haufe Publishing System
Microservice Transformation of the Haufe Publishing System
 
Haufe API Strategy
Haufe API StrategyHaufe API Strategy
Haufe API Strategy
 
Haufe's Tech Strategy In Practice
Haufe's Tech Strategy In PracticeHaufe's Tech Strategy In Practice
Haufe's Tech Strategy In Practice
 
Kubernetes Intro @HaufeDev
Kubernetes Intro @HaufeDev Kubernetes Intro @HaufeDev
Kubernetes Intro @HaufeDev
 
API Management with wicked.haufe.io
API Management with wicked.haufe.ioAPI Management with wicked.haufe.io
API Management with wicked.haufe.io
 

Recently uploaded

一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 

Recently uploaded (20)

Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 

ONA ( organizational network analysis ) to enable individuals to impact their organization - part 2

  • 1. Facebook social widget For content discovery Form of social pressure (2 friends like this) Two of your close friends like this - You are more likely to discover the content - You are more inclided to believe/like the content
  • 2. What if we can do the same?
  • 3.
  • 4. Create a blueprint for Future of Work in Factories. Test the prototype in the customer production Factory, and obtain first feedback of the prototype from end-users. Similar Contributions: Future of Onboarding Learning on Shopfloor Alicia Young Bob Stone Joe McNamara Yves Delacroix Learning: Learn to make software Prototypes Prototyping Future of Work Factories of the Future Experimental Design Text is analyzed in real time - Extract Topic Meaning - Keyword Extraction Keywords are displayed Key Influencers are found. You can: - See their contribution - Contact them directly Related Marketplace Projects Relevant Learning is found. Add to MyLearning Create a «Future of Work in Factories» Concept
  • 5. Similar Contributions: Future of Onboarding Learning on Shopfloor Alicia Young Bob Stone Joe McNamara Yves Delacroix Future of Work Factories of the Future Contact Alicia directly through the systemCheck Alicia´s contributions - See if you shoud align Align & Connect Align with people working on the similar things Find people from my network (that are in my communication neighborhood) Find key influencers company-wide Circle Indicates people from my immediate network
  • 6. New Marketplace Project Title Description Working on similar topics: Bob Stone Joe McNamara Self-driving buses The future of public transportation is in self-driving buses, that are driving through metropolitan and rural areas. The goal of this project is to create software for self-driving buses, which would be the backbone of the public transportation in metropolitan areas. Self-driving Public Transport Buses Interested in Contributing: Yves Delacroix Text is analyzed in real time - Extract Meaning from text - Keyword Extraction - Keyword translation to users language ONA network finds key people to contact - Based on keywords - Based on Impact/Influence Yves is interested in the topic - Shown as suggested for the project Invite Yves to contribute
  • 7. Our Approach: Privacy and GDPR Easier than feared! Privacy-by-design (GDPR): a) Article 4(5) defines pseudonymization as “the processing of personal data in such a way that the data can no longer be attributed to a specific data subject without the use of additional information.” b) For summary statistics or groups (non-PII) analytics: Recital 26 defines anonymized data as “data rendered anonymous in such a way that the data subject is not or no longer identifiable.” Cleanup PII Utilize PII re-bind ONA/NLP/ML Pseudonymize Dataset User Consent (PII Separation) NOTE: PII - Personally identifiable information 7
  • 8. What next? 8 1. Roll-out a GDPR-aware MVP 2. Learn from it and grow Data Product Managers 3. Apply ONA metrics & ML to pioneer new products
  • 9. NLP and Keyword extraction Input: Training email dataset • Sanitization, header removal • Filter outbound data Processing: • Simple NL preprocessing (lowercase, remove special characters, tokenize) • Named entity extraction and removal • Stopwords removal
  • 10. . ... continued Processing: • Lemmatization and part of speech filtering • Identify bigrams and trigams Now the data should be clean enough to do actual NLP on it.
  • 11. . LDA and RAKE LDA – Latent Dirichlet Allocation: • Topic modeling algorithm • Extracts relevant topics from a corpus • Unsupervised learning • Needs number of topics beforehand RAKE – Rapid Automatic Keyword Extraction: • Statistical modeling of text • Creates rank of keywords • Easy to run, not actually ML
  • 12. . Word embeddings and TF/IDF Word embeddings: • Used fasttext embeddings – multilanguage • Expand found keywords by similarity vectors • Threshold value for similarity TF/IDF: • Ranking and relevance for documents • Using any Lucene based solution • Get free n-grams and typos
  • 13. More Info? Visit our Article Series: bit.ly/2Rwhg2R 13 Vladimir Mijatovic | Alexandru Filip | Agron Fazliu February, 2019 Freiburg