SlideShare a Scribd company logo
Recommending Scientific Papers:
Investigating the User Curriculum
Jonathas Magalhães, Cleyton Souza,
Evandro Costa, Joseana Fechine
Warm up!
• How does research in Brazil work?
• What is Lattes?
• Why is Lattes a big deal in Brazil?
Lattes = Opportunity
• The information available in Lattes creates a
great opportunity to recommend science
related content to researchers in Brazil…
– Projects
– Contributors
– Call for Papers
– Papers
• … and to test different algorithtms!
Our Work
• In this paper:
– (1) We present a Personalized Paper
Recommender System
• A user-paper approach that takes into consideration
the Lattes information.
– (2) We test different profiling strategies
– (3) We test how much older information is
necessary in order to provide better
recommendations.
– (4) We compare our strategy with state of art
Recommendation Algorithm
• 𝑠𝑖𝑚 𝑢𝑠𝑒𝑟 𝑝𝑟𝑜𝑓𝑖𝑙𝑒, 𝑝𝑎𝑝𝑒𝑟
– Cosine similarity
• Age weighting
1 −
Δ𝑦
Δ𝑣
Profiling Strategies
• Concepts Profile
–The vector is composed of predefined
concepts.
• Terms Profile
–The vector is composed of the set of
terms that compose the dictionary.
Research Questions
• 𝑄1: How many years of the user curriculum
are necessary to use in order to provide great
recommendations?
• 𝑄2: Is there any difference between the
concepts profile and terms profile?
• 𝑄3: Is Lopes’s algorithm better than them?
• 𝑄4: Which method should we choose?
Evaluation
• To answer our research questions, we
conducted a user study experiment
• We developed a system to collect user’s
impression about a set of papers
Evaluation
• We used user’s impressions to ranking the
papers and compared with the outcome of
the recommendation algorithms to answer
our research questions
Results
The NDCG@5, NDCG@10 and length means of the methods of generated
profiles. We execute Shapiro-Wilk test to verify the data normality. The
symbol (*) indicates that the data is not normally distributed, i.e.,
𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05
Results
Results of hypothesis testes performed to compare the strategis. Both tests are
performed with parameters 𝛼 = 0.05, alternative = “greater”, paired=TRUE
(>> and > denote significance levels pof 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.01 and 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05,
respectively).
𝑄1: How many years of the user curriculum are
necessary to use in order to provide great
recommendations?
Answer: It depends of the profiling strategy: four
years for TP and five years for CP, apparently.
𝑄2: Is there any difference between the
concepts profile and terms profile?
Answer: Comparing the terms profile TP4 with
concepts profile CP5, we verify a not statically
proved superiority. Thus, there is no difference.
𝑄3: Is Lopes’s algorithm better than them?
Answer: Yes, both approaches (TP4 and CP5)
achieved statiscaly better performance than
Lopes.
𝑄4: Which method should we choose?
Answer: It depends on the context, because there is a
trade-off between the techniques. If the system needs an
online recommendation with reasonable quality, the CP
profiles are the best choice. On the other hand, if the
systems can compute the recommendations offline, and
the time consuming is not a problem, the T P is better.
Conclusion and Future Work
• We presented and evaluated our approach to a
paper Recommender System that considers the
user curriculum crawled from the CV-Lattes
• Our main contributions are:
– Our algorithms achieved better performance than
state-of-art paper recommendation algorithm dealing
with Lattes
– We observed no statistical difference between both
profiling strategies.
– We build a dataset that can be used for future
research in the are
Conclusion and Future Work
• Our planning:
– To confront our results with data from others CV-
oriented networks
– To work in a integration algorithm to combine
data from multiple sources
– To improve the recommendation model using
paper related information

More Related Content

What's hot

MELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...GESIS
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Alan Said
 
Data Collection and Analysis Tools
Data Collection and Analysis ToolsData Collection and Analysis Tools
Data Collection and Analysis ToolsCaren Gamboa
 
Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin
 
Data Analysis for All Students
Data Analysis for All StudentsData Analysis for All Students
Data Analysis for All StudentsL H
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...alessio_ferrari
 
Intro mathematical studies ia
Intro   mathematical studies iaIntro   mathematical studies ia
Intro mathematical studies iapmakunja
 
Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Martyn Cooper
 
Data Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsData Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsL H
 
Dispersion
DispersionDispersion
DispersionL H
 
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...Stephen Childs
 
Data Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksData Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksKatie Fang
 
Trends over time
Trends over timeTrends over time
Trends over timedjleach
 
Multiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMultiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMhairi Mcalpine
 
Data Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleData Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleIUPUI
 
intro to quantitative
intro to quantitativeintro to quantitative
intro to quantitativees kpdl
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Platforma Otwartej Nauki
 

What's hot (20)

MELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problem
 
Regenstrief WIP 07012015
Regenstrief WIP 07012015Regenstrief WIP 07012015
Regenstrief WIP 07012015
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Data Collection and Analysis Tools
Data Collection and Analysis ToolsData Collection and Analysis Tools
Data Collection and Analysis Tools
 
Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014
 
Data Analysis for All Students
Data Analysis for All StudentsData Analysis for All Students
Data Analysis for All Students
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
 
Intro mathematical studies ia
Intro   mathematical studies iaIntro   mathematical studies ia
Intro mathematical studies ia
 
Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015
 
Data Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsData Analysis for Subgroups of Students
Data Analysis for Subgroups of Students
 
Rubrics
RubricsRubrics
Rubrics
 
Dispersion
DispersionDispersion
Dispersion
 
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
 
Data Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksData Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinks
 
Trends over time
Trends over timeTrends over time
Trends over time
 
Multiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMultiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessments
 
Data Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleData Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise example
 
intro to quantitative
intro to quantitativeintro to quantitative
intro to quantitative
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
 

Viewers also liked

Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesJonathas Magalhães
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalJonathas Magalhães
 
Composicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploComposicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploJuan Belón Pérez
 
Web Services Composition
Web Services CompositionWeb Services Composition
Web Services Compositioneldorina
 
Building Negotiations Skills
Building Negotiations SkillsBuilding Negotiations Skills
Building Negotiations SkillsAlimakki
 

Viewers also liked (7)

Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed Educational
 
Composicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploComposicion de servicios web, un ejemplo
Composicion de servicios web, un ejemplo
 
Web Services Composition
Web Services CompositionWeb Services Composition
Web Services Composition
 
Managing anger
Managing angerManaging anger
Managing anger
 
Building Negotiations Skills
Building Negotiations SkillsBuilding Negotiations Skills
Building Negotiations Skills
 
Soa & services web
Soa & services webSoa & services web
Soa & services web
 

Similar to Recommending Scientific Papers: Investigating the User Curriculum

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyStatistics Solutions
 
Metrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetMetrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetAlan Fricker
 
How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share ilmideas
 
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...ilmideas
 
Glfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalGlfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalTricia Townsend
 
Lesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxLesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxGowshikaSekar
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...CesToronto
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...Codemotion
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodNorsaremah Salleh
 
Predictive Analytics in Practice
Predictive Analytics in PracticePredictive Analytics in Practice
Predictive Analytics in PracticeHobsons
 
Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Farhan Khan
 
Pg writing aims and proposal
Pg writing aims and proposalPg writing aims and proposal
Pg writing aims and proposalRhianWynWilliams
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterJonathas Magalhães
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsMartin Kretzer
 
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppte3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.pptappstore15
 
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docxASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docxsherni1
 

Similar to Recommending Scientific Papers: Investigating the User Curriculum (20)

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Metrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetMetrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNet
 
How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share
 
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
 
Glfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalGlfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_final
 
Lesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxLesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptx
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...
 
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review method
 
Predictive Analytics in Practice
Predictive Analytics in PracticePredictive Analytics in Practice
Predictive Analytics in Practice
 
Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!
 
Pg writing aims and proposal
Pg writing aims and proposalPg writing aims and proposal
Pg writing aims and proposal
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on Twitter
 
Systematic Literature Review
Systematic Literature ReviewSystematic Literature Review
Systematic Literature Review
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and Experiments
 
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppte3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
e3_chapter__5_evaluation_technics_HCeVpPLCvE.ppt
 
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
 
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docxASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
 

More from Jonathas Magalhães

Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookJonathas Magalhães
 
A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterJonathas Magalhães
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterJonathas Magalhães
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmJonathas Magalhães
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...Jonathas Magalhães
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Jonathas Magalhães
 

More from Jonathas Magalhães (10)

Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on Facebook
 
Redes Bayesianas
Redes BayesianasRedes Bayesianas
Redes Bayesianas
 
Probabilidade
ProbabilidadeProbabilidade
Probabilidade
 
A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in Twitter
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for Twitter
 
K-Nearest Neighbor
K-Nearest NeighborK-Nearest Neighbor
K-Nearest Neighbor
 
Naive Bayes
Naive BayesNaive Bayes
Naive Bayes
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing Algorithm
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...
 

Recently uploaded

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Thierry Lestable
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Product School
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsVlad Stirbu
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 

Recently uploaded (20)

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

Recommending Scientific Papers: Investigating the User Curriculum

  • 1. Recommending Scientific Papers: Investigating the User Curriculum Jonathas Magalhães, Cleyton Souza, Evandro Costa, Joseana Fechine
  • 2. Warm up! • How does research in Brazil work? • What is Lattes? • Why is Lattes a big deal in Brazil?
  • 3. Lattes = Opportunity • The information available in Lattes creates a great opportunity to recommend science related content to researchers in Brazil… – Projects – Contributors – Call for Papers – Papers • … and to test different algorithtms!
  • 4. Our Work • In this paper: – (1) We present a Personalized Paper Recommender System • A user-paper approach that takes into consideration the Lattes information. – (2) We test different profiling strategies – (3) We test how much older information is necessary in order to provide better recommendations. – (4) We compare our strategy with state of art
  • 5. Recommendation Algorithm • 𝑠𝑖𝑚 𝑢𝑠𝑒𝑟 𝑝𝑟𝑜𝑓𝑖𝑙𝑒, 𝑝𝑎𝑝𝑒𝑟 – Cosine similarity • Age weighting 1 − Δ𝑦 Δ𝑣
  • 6. Profiling Strategies • Concepts Profile –The vector is composed of predefined concepts. • Terms Profile –The vector is composed of the set of terms that compose the dictionary.
  • 7. Research Questions • 𝑄1: How many years of the user curriculum are necessary to use in order to provide great recommendations? • 𝑄2: Is there any difference between the concepts profile and terms profile? • 𝑄3: Is Lopes’s algorithm better than them? • 𝑄4: Which method should we choose?
  • 8. Evaluation • To answer our research questions, we conducted a user study experiment • We developed a system to collect user’s impression about a set of papers
  • 9. Evaluation • We used user’s impressions to ranking the papers and compared with the outcome of the recommendation algorithms to answer our research questions
  • 10. Results The NDCG@5, NDCG@10 and length means of the methods of generated profiles. We execute Shapiro-Wilk test to verify the data normality. The symbol (*) indicates that the data is not normally distributed, i.e., 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05
  • 11. Results Results of hypothesis testes performed to compare the strategis. Both tests are performed with parameters 𝛼 = 0.05, alternative = “greater”, paired=TRUE (>> and > denote significance levels pof 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.01 and 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05, respectively).
  • 12. 𝑄1: How many years of the user curriculum are necessary to use in order to provide great recommendations? Answer: It depends of the profiling strategy: four years for TP and five years for CP, apparently.
  • 13. 𝑄2: Is there any difference between the concepts profile and terms profile? Answer: Comparing the terms profile TP4 with concepts profile CP5, we verify a not statically proved superiority. Thus, there is no difference.
  • 14. 𝑄3: Is Lopes’s algorithm better than them? Answer: Yes, both approaches (TP4 and CP5) achieved statiscaly better performance than Lopes.
  • 15. 𝑄4: Which method should we choose? Answer: It depends on the context, because there is a trade-off between the techniques. If the system needs an online recommendation with reasonable quality, the CP profiles are the best choice. On the other hand, if the systems can compute the recommendations offline, and the time consuming is not a problem, the T P is better.
  • 16. Conclusion and Future Work • We presented and evaluated our approach to a paper Recommender System that considers the user curriculum crawled from the CV-Lattes • Our main contributions are: – Our algorithms achieved better performance than state-of-art paper recommendation algorithm dealing with Lattes – We observed no statistical difference between both profiling strategies. – We build a dataset that can be used for future research in the are
  • 17. Conclusion and Future Work • Our planning: – To confront our results with data from others CV- oriented networks – To work in a integration algorithm to combine data from multiple sources – To improve the recommendation model using paper related information