SlideShare a Scribd company logo
1 of 17
Download to read offline
Recommending Scientific Papers:
Investigating the User Curriculum
Jonathas Magalhães, Cleyton Souza,
Evandro Costa, Joseana Fechine
Warm up!
• How does research in Brazil work?
• What is Lattes?
• Why is Lattes a big deal in Brazil?
Lattes = Opportunity
• The information available in Lattes creates a
great opportunity to recommend science
related content to researchers in Brazil…
– Projects
– Contributors
– Call for Papers
– Papers
• … and to test different algorithtms!
Our Work
• In this paper:
– (1) We present a Personalized Paper
Recommender System
• A user-paper approach that takes into consideration
the Lattes information.
– (2) We test different profiling strategies
– (3) We test how much older information is
necessary in order to provide better
recommendations.
– (4) We compare our strategy with state of art
Recommendation Algorithm
• 𝑠𝑖𝑚 𝑢𝑠𝑒𝑟 𝑝𝑟𝑜𝑓𝑖𝑙𝑒, 𝑝𝑎𝑝𝑒𝑟
– Cosine similarity
• Age weighting
1 −
Δ𝑦
Δ𝑣
Profiling Strategies
• Concepts Profile
–The vector is composed of predefined
concepts.
• Terms Profile
–The vector is composed of the set of
terms that compose the dictionary.
Research Questions
• 𝑄1: How many years of the user curriculum
are necessary to use in order to provide great
recommendations?
• 𝑄2: Is there any difference between the
concepts profile and terms profile?
• 𝑄3: Is Lopes’s algorithm better than them?
• 𝑄4: Which method should we choose?
Evaluation
• To answer our research questions, we
conducted a user study experiment
• We developed a system to collect user’s
impression about a set of papers
Evaluation
• We used user’s impressions to ranking the
papers and compared with the outcome of
the recommendation algorithms to answer
our research questions
Results
The NDCG@5, NDCG@10 and length means of the methods of generated
profiles. We execute Shapiro-Wilk test to verify the data normality. The
symbol (*) indicates that the data is not normally distributed, i.e.,
𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05
Results
Results of hypothesis testes performed to compare the strategis. Both tests are
performed with parameters 𝛼 = 0.05, alternative = “greater”, paired=TRUE
(>> and > denote significance levels pof 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.01 and 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05,
respectively).
𝑄1: How many years of the user curriculum are
necessary to use in order to provide great
recommendations?
Answer: It depends of the profiling strategy: four
years for TP and five years for CP, apparently.
𝑄2: Is there any difference between the
concepts profile and terms profile?
Answer: Comparing the terms profile TP4 with
concepts profile CP5, we verify a not statically
proved superiority. Thus, there is no difference.
𝑄3: Is Lopes’s algorithm better than them?
Answer: Yes, both approaches (TP4 and CP5)
achieved statiscaly better performance than
Lopes.
𝑄4: Which method should we choose?
Answer: It depends on the context, because there is a
trade-off between the techniques. If the system needs an
online recommendation with reasonable quality, the CP
profiles are the best choice. On the other hand, if the
systems can compute the recommendations offline, and
the time consuming is not a problem, the T P is better.
Conclusion and Future Work
• We presented and evaluated our approach to a
paper Recommender System that considers the
user curriculum crawled from the CV-Lattes
• Our main contributions are:
– Our algorithms achieved better performance than
state-of-art paper recommendation algorithm dealing
with Lattes
– We observed no statistical difference between both
profiling strategies.
– We build a dataset that can be used for future
research in the are
Conclusion and Future Work
• Our planning:
– To confront our results with data from others CV-
oriented networks
– To work in a integration algorithm to combine
data from multiple sources
– To improve the recommendation model using
paper related information

More Related Content

What's hot

MELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...GESIS
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Alan Said
 
Data Collection and Analysis Tools
Data Collection and Analysis ToolsData Collection and Analysis Tools
Data Collection and Analysis ToolsCaren Gamboa
 
Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin
 
Data Analysis for All Students
Data Analysis for All StudentsData Analysis for All Students
Data Analysis for All StudentsL H
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...alessio_ferrari
 
Intro mathematical studies ia
Intro   mathematical studies iaIntro   mathematical studies ia
Intro mathematical studies iapmakunja
 
Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Martyn Cooper
 
Data Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsData Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsL H
 
Dispersion
DispersionDispersion
DispersionL H
 
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...Stephen Childs
 
Data Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksData Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksKatie Fang
 
Trends over time
Trends over timeTrends over time
Trends over timedjleach
 
Multiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMultiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMhairi Mcalpine
 
Data Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleData Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleIUPUI
 
intro to quantitative
intro to quantitativeintro to quantitative
intro to quantitativees kpdl
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Platforma Otwartej Nauki
 

What's hot (20)

MELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problemMELJUN CORTES research seminar_1_formulating_a_research_problem
MELJUN CORTES research seminar_1_formulating_a_research_problem
 
Regenstrief WIP 07012015
Regenstrief WIP 07012015Regenstrief WIP 07012015
Regenstrief WIP 07012015
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
Comparative Recommender System Evaluation: Benchmarking Recommendation Frame...
 
Data Collection and Analysis Tools
Data Collection and Analysis ToolsData Collection and Analysis Tools
Data Collection and Analysis Tools
 
Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014Adam Etkin's Flash Presentation from STM Spring 2014
Adam Etkin's Flash Presentation from STM Spring 2014
 
Data Analysis for All Students
Data Analysis for All StudentsData Analysis for All Students
Data Analysis for All Students
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
 
Intro mathematical studies ia
Intro   mathematical studies iaIntro   mathematical studies ia
Intro mathematical studies ia
 
Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015Learning analytics and accessibility – #calrg 2015
Learning analytics and accessibility – #calrg 2015
 
Data Analysis for Subgroups of Students
Data Analysis for Subgroups of StudentsData Analysis for Subgroups of Students
Data Analysis for Subgroups of Students
 
Rubrics
RubricsRubrics
Rubrics
 
Dispersion
DispersionDispersion
Dispersion
 
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
CIRPA 2016: Individual Level Predictive Analytics for Improving Student Enrol...
 
Data Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinksData Driven College Counseling by SchooLinks
Data Driven College Counseling by SchooLinks
 
Trends over time
Trends over timeTrends over time
Trends over time
 
Multiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessmentsMultiple Response Questions - Allowing for chance in authentic assessments
Multiple Response Questions - Allowing for chance in authentic assessments
 
Data Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise exampleData Management Lab: Data mapping exercise example
Data Management Lab: Data mapping exercise example
 
intro to quantitative
intro to quantitativeintro to quantitative
intro to quantitative
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
 

Viewers also liked

Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesJonathas Magalhães
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalJonathas Magalhães
 
Composicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploComposicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploJuan Belón Pérez
 
Web Services Composition
Web Services CompositionWeb Services Composition
Web Services Compositioneldorina
 
Building Negotiations Skills
Building Negotiations SkillsBuilding Negotiations Skills
Building Negotiations SkillsAlimakki
 

Viewers also liked (7)

Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed Educational
 
Composicion de servicios web, un ejemplo
Composicion de servicios web, un ejemploComposicion de servicios web, un ejemplo
Composicion de servicios web, un ejemplo
 
Web Services Composition
Web Services CompositionWeb Services Composition
Web Services Composition
 
Managing anger
Managing angerManaging anger
Managing anger
 
Building Negotiations Skills
Building Negotiations SkillsBuilding Negotiations Skills
Building Negotiations Skills
 
Soa & services web
Soa & services webSoa & services web
Soa & services web
 

Similar to Recommending Scientific Papers: Investigating the User Curriculum

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyStatistics Solutions
 
Metrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetMetrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetAlan Fricker
 
How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share ilmideas
 
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...ilmideas
 
Glfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalGlfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalTricia Townsend
 
Lesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxLesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxGowshikaSekar
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...CesToronto
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...Codemotion
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodNorsaremah Salleh
 
Predictive Analytics in Practice
Predictive Analytics in PracticePredictive Analytics in Practice
Predictive Analytics in PracticeHobsons
 
Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Farhan Khan
 
Pg writing aims and proposal
Pg writing aims and proposalPg writing aims and proposal
Pg writing aims and proposalRhianWynWilliams
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterJonathas Magalhães
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsMartin Kretzer
 
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docxASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docxsherni1
 
Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...nicholes21
 
Survey Research In Empirical Software Engineering
Survey Research In Empirical Software EngineeringSurvey Research In Empirical Software Engineering
Survey Research In Empirical Software Engineeringalessio_ferrari
 

Similar to Recommending Scientific Papers: Investigating the User Curriculum (20)

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Metrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNetMetrics Workshop for YOHHLNet
Metrics Workshop for YOHHLNet
 
How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share How to Design Research from Ilm Ideas on Slide Share
How to Design Research from Ilm Ideas on Slide Share
 
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
How to Develop and Implement Effective Research Tools from Ilm Ideas on Slide...
 
Glfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_finalGlfes summer institute2013_raleigh_final
Glfes summer institute2013_raleigh_final
 
Lesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptxLesson 5a_Surveys and Measurement 2023.pptx
Lesson 5a_Surveys and Measurement 2023.pptx
 
Improving evaluations and utilization with statistical edge nested data desi...
Improving evaluations and utilization with statistical edge  nested data desi...Improving evaluations and utilization with statistical edge  nested data desi...
Improving evaluations and utilization with statistical edge nested data desi...
 
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
l’outil CASP pour les études qualitatives – webinaire du Club de lecture en l...
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
 
Introduction to Systematic Literature Review method
Introduction to Systematic Literature Review methodIntroduction to Systematic Literature Review method
Introduction to Systematic Literature Review method
 
Predictive Analytics in Practice
Predictive Analytics in PracticePredictive Analytics in Practice
Predictive Analytics in Practice
 
Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!Big Data Analytics - It is here and now!
Big Data Analytics - It is here and now!
 
Pg writing aims and proposal
Pg writing aims and proposalPg writing aims and proposal
Pg writing aims and proposal
 
A Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on TwitterA Query Routing Model to Rank Expertcandidates on Twitter
A Query Routing Model to Rank Expertcandidates on Twitter
 
Systematic Literature Review
Systematic Literature ReviewSystematic Literature Review
Systematic Literature Review
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and Experiments
 
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
CASP Tool for Qualitative Studies (Sample Answers - September 19 and 27, 2018...
 
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docxASSIGNMENT 2 - Research Proposal    Weighting 30 tow.docx
ASSIGNMENT 2 - Research Proposal Weighting 30 tow.docx
 
Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...
 
Survey Research In Empirical Software Engineering
Survey Research In Empirical Software EngineeringSurvey Research In Empirical Software Engineering
Survey Research In Empirical Software Engineering
 

More from Jonathas Magalhães

Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookJonathas Magalhães
 
A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterJonathas Magalhães
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterJonathas Magalhães
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmJonathas Magalhães
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...Jonathas Magalhães
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Jonathas Magalhães
 

More from Jonathas Magalhães (10)

Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on Facebook
 
Redes Bayesianas
Redes BayesianasRedes Bayesianas
Redes Bayesianas
 
Probabilidade
ProbabilidadeProbabilidade
Probabilidade
 
A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in Twitter
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for Twitter
 
K-Nearest Neighbor
K-Nearest NeighborK-Nearest Neighbor
K-Nearest Neighbor
 
Naive Bayes
Naive BayesNaive Bayes
Naive Bayes
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing Algorithm
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...
 

Recently uploaded

Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesDavid Newbury
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDELiveplex
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintMahmoud Rabie
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesMd Hossain Ali
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1DianaGray10
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureEric D. Schabell
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UbiTrack UK
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfinfogdgmi
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URLRuncy Oommen
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 

Recently uploaded (20)

20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership Blueprint
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
 
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
UWB Technology for Enhanced Indoor and Outdoor Positioning in Physiological M...
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdf
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URL
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 

Recommending Scientific Papers: Investigating the User Curriculum

  • 1. Recommending Scientific Papers: Investigating the User Curriculum Jonathas Magalhães, Cleyton Souza, Evandro Costa, Joseana Fechine
  • 2. Warm up! • How does research in Brazil work? • What is Lattes? • Why is Lattes a big deal in Brazil?
  • 3. Lattes = Opportunity • The information available in Lattes creates a great opportunity to recommend science related content to researchers in Brazil… – Projects – Contributors – Call for Papers – Papers • … and to test different algorithtms!
  • 4. Our Work • In this paper: – (1) We present a Personalized Paper Recommender System • A user-paper approach that takes into consideration the Lattes information. – (2) We test different profiling strategies – (3) We test how much older information is necessary in order to provide better recommendations. – (4) We compare our strategy with state of art
  • 5. Recommendation Algorithm • 𝑠𝑖𝑚 𝑢𝑠𝑒𝑟 𝑝𝑟𝑜𝑓𝑖𝑙𝑒, 𝑝𝑎𝑝𝑒𝑟 – Cosine similarity • Age weighting 1 − Δ𝑦 Δ𝑣
  • 6. Profiling Strategies • Concepts Profile –The vector is composed of predefined concepts. • Terms Profile –The vector is composed of the set of terms that compose the dictionary.
  • 7. Research Questions • 𝑄1: How many years of the user curriculum are necessary to use in order to provide great recommendations? • 𝑄2: Is there any difference between the concepts profile and terms profile? • 𝑄3: Is Lopes’s algorithm better than them? • 𝑄4: Which method should we choose?
  • 8. Evaluation • To answer our research questions, we conducted a user study experiment • We developed a system to collect user’s impression about a set of papers
  • 9. Evaluation • We used user’s impressions to ranking the papers and compared with the outcome of the recommendation algorithms to answer our research questions
  • 10. Results The NDCG@5, NDCG@10 and length means of the methods of generated profiles. We execute Shapiro-Wilk test to verify the data normality. The symbol (*) indicates that the data is not normally distributed, i.e., 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05
  • 11. Results Results of hypothesis testes performed to compare the strategis. Both tests are performed with parameters 𝛼 = 0.05, alternative = “greater”, paired=TRUE (>> and > denote significance levels pof 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.01 and 𝑝 − 𝑣𝑎𝑙𝑢𝑒 < 0.05, respectively).
  • 12. 𝑄1: How many years of the user curriculum are necessary to use in order to provide great recommendations? Answer: It depends of the profiling strategy: four years for TP and five years for CP, apparently.
  • 13. 𝑄2: Is there any difference between the concepts profile and terms profile? Answer: Comparing the terms profile TP4 with concepts profile CP5, we verify a not statically proved superiority. Thus, there is no difference.
  • 14. 𝑄3: Is Lopes’s algorithm better than them? Answer: Yes, both approaches (TP4 and CP5) achieved statiscaly better performance than Lopes.
  • 15. 𝑄4: Which method should we choose? Answer: It depends on the context, because there is a trade-off between the techniques. If the system needs an online recommendation with reasonable quality, the CP profiles are the best choice. On the other hand, if the systems can compute the recommendations offline, and the time consuming is not a problem, the T P is better.
  • 16. Conclusion and Future Work • We presented and evaluated our approach to a paper Recommender System that considers the user curriculum crawled from the CV-Lattes • Our main contributions are: – Our algorithms achieved better performance than state-of-art paper recommendation algorithm dealing with Lattes – We observed no statistical difference between both profiling strategies. – We build a dataset that can be used for future research in the are
  • 17. Conclusion and Future Work • Our planning: – To confront our results with data from others CV- oriented networks – To work in a integration algorithm to combine data from multiple sources – To improve the recommendation model using paper related information