SlideShare a Scribd company logo
1 of 1
PROBLEM & APPROACH
Research Questions:
• What are recent topics in machine learning
research?
• Is there any interesting relationship among these
topics?
• Is it possible to find research document based on
relevancy instead of simple keywords?
Approaches:
• Analyze machine learning publications from major
research journals
• Identify underlying topics of these document using
topic modelling
• Identify interesting relationship among topics
using network analysis
• Compute relevance score of document based on
topic distribution similarity
DATA ANALYSIS PIPELINE TOPIC MODEL
Model Selection:
Topics:
Simon Fraser University
Machine Learning Research Exploration
Jacob Su, Liam Bui
SUMMARY
• Major recent topics and relationships might be different from our
impression
• The proposed topic distribution similarity-based approach can provide
relevant search results and help auto-tagging.
Learning:
NETWORK ANALYSIS
MongoDB:
- Title
- Authors
- Abstract
- Content
- URL
Data Source:
3-year publications
7000 document
4GBs data
Exploratory Analysis:
Word Cloud & Frequency
Topic Model:
Latent Dirichlet Allocation
Network Analysis:
Co-occurrence
Independence
Data Product:
Document Search based on
topic distribution similarity
DATA PRODUCT
DATA EXPLORATION:
DATA COLLECTION:
Web Scraping
PDF Processing
Data Pre-Processing:
Word Lemmatization
Stop Word Removal
Feature Selection:
Abstract vs Full Text
TF vs TF-IDF
Browse
Search
Tag
Node Size: Eigen Centrality Edge: X2 significant association
Choose k = 45
“Full Text” Word Cloud “Abstract” Word Cloud
Methodology: Data Visualization – Model Interpretation – Project Planning
Technology: Scrapy, wordcloud, Gephi, LDA, GraphX, MongoDB, Django
Evaluation:
- Perplexity
- Word Intrusion

More Related Content

What's hot

Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Shahriar Rafee
 
SOC2002 Lecture 12
SOC2002 Lecture 12SOC2002 Lecture 12
SOC2002 Lecture 12Bonnie Green
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkAravind Sesagiri Raamkumar
 
Usability Evaluation of a Research Repository and Collaboration Website For H...
Usability Evaluation of a Research Repository and Collaboration Website For H...Usability Evaluation of a Research Repository and Collaboration Website For H...
Usability Evaluation of a Research Repository and Collaboration Website For H...Tao Zhang
 
Early Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsEarly Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsAngelo Salatino
 
What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...Aravind Sesagiri Raamkumar
 
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksDetection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksAngelo Salatino
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboardpetrknoth
 
V.3 poster current citations and a future with linked data
V.3 poster current citations and a future with linked dataV.3 poster current citations and a future with linked data
V.3 poster current citations and a future with linked dataIliadis Dimitrios
 
More Than Just Black and White: A Case for Grey Literature References in Scie...
More Than Just Black and White: A Case for Grey Literature References in Scie...More Than Just Black and White: A Case for Grey Literature References in Scie...
More Than Just Black and White: A Case for Grey Literature References in Scie...Aravind Sesagiri Raamkumar
 
Mining Virtual Reference Data for an Iterative Assessment Cycle
Mining Virtual Reference Data for an Iterative Assessment CycleMining Virtual Reference Data for an Iterative Assessment Cycle
Mining Virtual Reference Data for an Iterative Assessment CycleAmanda Clay Powers
 
ICPSR Data Services at Kenyon College - David Thomas
ICPSR Data Services at Kenyon College - David ThomasICPSR Data Services at Kenyon College - David Thomas
ICPSR Data Services at Kenyon College - David Thomasjoemmurphy
 
Bl labs ucl-services
Bl labs ucl-servicesBl labs ucl-services
Bl labs ucl-servicesbenosteen
 
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...Eliezer Silva
 
Owl presentation
Owl presentationOwl presentation
Owl presentationBobby Sr.
 

What's hot (20)

Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2
 
SOC2002 Lecture 12
SOC2002 Lecture 12SOC2002 Lecture 12
SOC2002 Lecture 12
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
 
Usability Evaluation of a Research Repository and Collaboration Website For H...
Usability Evaluation of a Research Repository and Collaboration Website For H...Usability Evaluation of a Research Repository and Collaboration Website For H...
Usability Evaluation of a Research Repository and Collaboration Website For H...
 
Early Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsEarly Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research Trends
 
SOC2002 Lecture 2
SOC2002 Lecture 2SOC2002 Lecture 2
SOC2002 Lecture 2
 
What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...
 
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksDetection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
 
data citation
data citationdata citation
data citation
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
V.3 poster current citations and a future with linked data
V.3 poster current citations and a future with linked dataV.3 poster current citations and a future with linked data
V.3 poster current citations and a future with linked data
 
More Than Just Black and White: A Case for Grey Literature References in Scie...
More Than Just Black and White: A Case for Grey Literature References in Scie...More Than Just Black and White: A Case for Grey Literature References in Scie...
More Than Just Black and White: A Case for Grey Literature References in Scie...
 
Mining Virtual Reference Data for an Iterative Assessment Cycle
Mining Virtual Reference Data for an Iterative Assessment CycleMining Virtual Reference Data for an Iterative Assessment Cycle
Mining Virtual Reference Data for an Iterative Assessment Cycle
 
ICPSR Data Services at Kenyon College - David Thomas
ICPSR Data Services at Kenyon College - David ThomasICPSR Data Services at Kenyon College - David Thomas
ICPSR Data Services at Kenyon College - David Thomas
 
SOC2002 Lecture 1
SOC2002 Lecture 1SOC2002 Lecture 1
SOC2002 Lecture 1
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Bl labs ucl-services
Bl labs ucl-servicesBl labs ucl-services
Bl labs ucl-services
 
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...
Content-Based Social Recommendation with Poisson Matrix Factorization (ECML-P...
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Owl presentation
Owl presentationOwl presentation
Owl presentation
 

Similar to Topic model

Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Vrije Universiteit Amsterdam
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...AKSHAY BHAGAT
 
Fancy resumeziweimeng0502
Fancy resumeziweimeng0502Fancy resumeziweimeng0502
Fancy resumeziweimeng0502Ziwei(Zoe) Meng
 
Ux Research Portfolio
Ux Research PortfolioUx Research Portfolio
Ux Research PortfolioTao Zhang
 
Literature review in social sciences
Literature review in social sciencesLiterature review in social sciences
Literature review in social sciencesDr. Chinchu C
 
Finding research evidence 2016
Finding research evidence 2016 Finding research evidence 2016
Finding research evidence 2016 John Iona
 
empirical-SLR.pptx
empirical-SLR.pptxempirical-SLR.pptx
empirical-SLR.pptxJitha Kannan
 
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...Pam Marino
 
User centred design and students' library search behaviours
User centred design and students' library search behavioursUser centred design and students' library search behaviours
User centred design and students' library search behavioursVernon Fowler
 
Learning Emergent Knowledge from Blog Postings
Learning Emergent Knowledge from Blog PostingsLearning Emergent Knowledge from Blog Postings
Learning Emergent Knowledge from Blog PostingsSaltlux Inc.
 
Linked data for Libraries
Linked data for LibrariesLinked data for Libraries
Linked data for Librariesrobin fay
 
PSY4035 finding research info 2017
PSY4035 finding research info 2017 PSY4035 finding research info 2017
PSY4035 finding research info 2017 John Iona
 
Visualizing communication at scad school of design
Visualizing communication at scad school of designVisualizing communication at scad school of design
Visualizing communication at scad school of designSAAD ALZAROONI, CM
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniquesveningstonk
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Research Writing - Universitas Indonesia
Research Writing - Universitas IndonesiaResearch Writing - Universitas Indonesia
Research Writing - Universitas IndonesiaFariz Darari
 
Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)danw421
 

Similar to Topic model (20)

Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
Fancy resumeziweimeng0502
Fancy resumeziweimeng0502Fancy resumeziweimeng0502
Fancy resumeziweimeng0502
 
Ux Research Portfolio
Ux Research PortfolioUx Research Portfolio
Ux Research Portfolio
 
Literature review in social sciences
Literature review in social sciencesLiterature review in social sciences
Literature review in social sciences
 
Finding research evidence 2016
Finding research evidence 2016 Finding research evidence 2016
Finding research evidence 2016
 
Blog clustering
Blog clusteringBlog clustering
Blog clustering
 
empirical-SLR.pptx
empirical-SLR.pptxempirical-SLR.pptx
empirical-SLR.pptx
 
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...
S^4-Super Summer Science Search™ Contest Study Session #3: Selecting Your Top...
 
User centred design and students' library search behaviours
User centred design and students' library search behavioursUser centred design and students' library search behaviours
User centred design and students' library search behaviours
 
Learning Emergent Knowledge from Blog Postings
Learning Emergent Knowledge from Blog PostingsLearning Emergent Knowledge from Blog Postings
Learning Emergent Knowledge from Blog Postings
 
Linked data for Libraries
Linked data for LibrariesLinked data for Libraries
Linked data for Libraries
 
Final presentation
Final presentationFinal presentation
Final presentation
 
PSY4035 finding research info 2017
PSY4035 finding research info 2017 PSY4035 finding research info 2017
PSY4035 finding research info 2017
 
Visualizing communication at scad school of design
Visualizing communication at scad school of designVisualizing communication at scad school of design
Visualizing communication at scad school of design
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniques
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Research Writing - Universitas Indonesia
Research Writing - Universitas IndonesiaResearch Writing - Universitas Indonesia
Research Writing - Universitas Indonesia
 
Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)Student research behavior — prototype application (at CIL)
Student research behavior — prototype application (at CIL)
 

Recently uploaded

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 

Recently uploaded (20)

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 

Topic model

  • 1. PROBLEM & APPROACH Research Questions: • What are recent topics in machine learning research? • Is there any interesting relationship among these topics? • Is it possible to find research document based on relevancy instead of simple keywords? Approaches: • Analyze machine learning publications from major research journals • Identify underlying topics of these document using topic modelling • Identify interesting relationship among topics using network analysis • Compute relevance score of document based on topic distribution similarity DATA ANALYSIS PIPELINE TOPIC MODEL Model Selection: Topics: Simon Fraser University Machine Learning Research Exploration Jacob Su, Liam Bui SUMMARY • Major recent topics and relationships might be different from our impression • The proposed topic distribution similarity-based approach can provide relevant search results and help auto-tagging. Learning: NETWORK ANALYSIS MongoDB: - Title - Authors - Abstract - Content - URL Data Source: 3-year publications 7000 document 4GBs data Exploratory Analysis: Word Cloud & Frequency Topic Model: Latent Dirichlet Allocation Network Analysis: Co-occurrence Independence Data Product: Document Search based on topic distribution similarity DATA PRODUCT DATA EXPLORATION: DATA COLLECTION: Web Scraping PDF Processing Data Pre-Processing: Word Lemmatization Stop Word Removal Feature Selection: Abstract vs Full Text TF vs TF-IDF Browse Search Tag Node Size: Eigen Centrality Edge: X2 significant association Choose k = 45 “Full Text” Word Cloud “Abstract” Word Cloud Methodology: Data Visualization – Model Interpretation – Project Planning Technology: Scrapy, wordcloud, Gephi, LDA, GraphX, MongoDB, Django Evaluation: - Perplexity - Word Intrusion

Editor's Notes

  1. JMLR, NIPS,IEEE, ARXV, SpringerLink