SlideShare a Scribd company logo
Fajie Yuan (Westlake University, Tencent), Guoxiao Zhang(Tencent ),
Alexandros Karatzoglou (Google Research), Joemon Jose (University of Glasgow),
Beibei Kong (Tencent), Yudong Li(Tencent)
One Person, One Model, One World:
Learning Continual User Representation
without Forgetting
SIGIR2021
Data&Code: https://github.com/fajieyuan/SIGIR2021_Conure
 Motivation
 Related Work
 Conure
 Experiments
Outline
A person has different roles to play in life!
But all these roles may have some
commonalities, such as personalization,
habits, preference.
Our Motivation
One Person, One Model, One World
Our Focus:
Whether we can build a user representation model that could
keep learning throughout all sequential tasks without forgetting
Car Rec
News Rec
Music Rec
Video Rec Video Rec
Social APP
Social APP
Video Rec
Search
Engine
Browser
Map
APP
Video Rec
A person has different roles to
play in life!But all these roles
may have some commonalities,
such as personalization,
habits, preference.
TikTok -- warm user Amazon —cold users
No interaction
Ads --- new users
Clicking logs
Keypoints
• Necessity and possibility why lifelong learning for UR learning?
• Lifelong learning paradigm throughout all tasks.
• Performance gain for tasks have certain correlations.
Using Lifelong learning techniques to solve recommendation tasks
 Motivation
 Related Work
 Conure
 Experiments
Outline
• Classical UR models (works well but is specific to only one task)
SASRec(Kang et al ICDM2018)
NextItNet (Yuan et al WSDM2019)
DSSM(Huang et al CIKM2013) Grec (Yuan et al WWW2020 )
GRU4Rec (Hidasi et al ICLR2016)
• PeterRec (Two-stage Transfer Learning):
• PeterRec (Finetuning):
• Transfer Learning Paradigm Comparisons:
(a) Standard TF (b) PeterRec (b) Conure (b) MTL
Lifelong learning without parameter preserving
 Motivation
 Related Work
 Conure
 Experiments
Outline
• Catastrophic Forgetting :
Parameter
Changes
Last hidden
Vector Changes
• Over-parameterization:
—(1) the more parameters are pruned, the worse it performs
—(2) performing retraining on the pruned network (i.e., “pr70+retrain”) regains its original accuracy quickly
—(3) smaller models (i.e., (b))are also highly over-parameterized
Conure is conceptually very
simple, easy to implement,
and applicable to various
sequential encoder networks.
• Conure architecture and learning process.
 Motivation
 Related Work
 Conure
 Experiments
Outline
• Datasets:
TTL: https://drive.google.com/file/d/1imhHUsivh6oMEtEW-RwVc4OsDqn-xOaP/view
ML: https://drive.google.com/file/d/1-_KmnZFaOdH11keLYVcgkf-kW_BaM266/view
• Results:
—(1) Conure largely outperforms other models on T3 because of the positive transfer from T1 and T2
—(2) Conure, PeterRec and FineAll largely outperforms SimMo because of of the positive transfer from T1
—(3) SinMoAll performs much worse on most tasks (except the last one) because of catastrophic forgetting
• Ablation study- T2 for T3:
—(1) Without training T2,Conure shows worse results,e.g., -6.5% on TTL20%
• Ablation study- Task order:
—(1) Conure is not sensitive to the task order.
• Ablation study:
—(1) pruning also works for the embedding layer
—(1) Conure is not restricted to specialized
sequential encoder.
—(2) Conure with the Transformer backbone works
a bit better than it with NextItNet.
Contributions:
—(1)providing the first lifelong learning paradigm for user representations.
—(2)providing insights for forgetting and redundancy issues in user representation models
—(3)designing Conure, the first lifelong learning algorithm - smple and easy to implement
—(4)instantiazing Conure with NextItNet and Transformer backbones
—(5)Extensive experiments with SOTA performance with many new discoveries and insights
Case study:
One Person, One Model, One World:  Learning Continual User Representation  without Forgetting SIGIR2021
One Person, One Model, One World:  Learning Continual User Representation  without Forgetting SIGIR2021
One Person, One Model, One World:  Learning Continual User Representation  without Forgetting SIGIR2021

More Related Content

Similar to One Person, One Model, One World: Learning Continual User Representation without Forgetting SIGIR2021

Robot simulation for_control_design
Robot simulation for_control_designRobot simulation for_control_design
Robot simulation for_control_design
AlFakir Fikri AlTakiri
 
IRJET - Human Pose Detection using Deep Learning
IRJET - Human Pose Detection using Deep LearningIRJET - Human Pose Detection using Deep Learning
IRJET - Human Pose Detection using Deep Learning
IRJET Journal
 
Human pose detection using machine learning by Grandel
Human pose detection using machine learning by GrandelHuman pose detection using machine learning by Grandel
Human pose detection using machine learning by Grandel
GrandelDsouza
 
It Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
It Does What You Say, Not What You Mean: Lessons From A Decade of Program RepairIt Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
It Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
Claire Le Goues
 
Ch01lect1 et
Ch01lect1 etCh01lect1 et
Ch01lect1 et
AhmedHassanHaji1
 
Best Practices - Software Engineering
Best Practices - Software EngineeringBest Practices - Software Engineering
Best Practices - Software Engineering
3Quill Softwares
 
Basics of modeling and simulation
Basics of modeling and simulationBasics of modeling and simulation
Basics of modeling and simulation
Serdar Kaya
 
Introduction to simulation and modeling
Introduction to simulation and modelingIntroduction to simulation and modeling
Introduction to simulation and modeling
antim19
 
Wanna see your open source project succeed? - Nurture your community
Wanna see your open source project succeed? - Nurture your communityWanna see your open source project succeed? - Nurture your community
Wanna see your open source project succeed? - Nurture your community
Jordi Cabot
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
AgileNetwork
 
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOTA Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
IRJET Journal
 
Real Time Processing Using Twitter Heron by Karthik Ramasamy
Real Time Processing Using Twitter Heron by Karthik RamasamyReal Time Processing Using Twitter Heron by Karthik Ramasamy
Real Time Processing Using Twitter Heron by Karthik Ramasamy
Data Con LA
 
Software Analytics = Sharing Information
Software Analytics = Sharing InformationSoftware Analytics = Sharing Information
Software Analytics = Sharing Information
Thomas Zimmermann
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald
 
Simulation and modeling introduction.pptx
Simulation and modeling introduction.pptxSimulation and modeling introduction.pptx
Simulation and modeling introduction.pptx
ShamasRehman4
 
Testing In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the CloudTesting In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the Cloud
SOASTA
 
The Grand Unified Theory of Autonomous Systems, Humans and Simulation
The Grand Unified Theory of Autonomous Systems, Humans and SimulationThe Grand Unified Theory of Autonomous Systems, Humans and Simulation
The Grand Unified Theory of Autonomous Systems, Humans and Simulation
Andy Fawkes
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
Functional Genomics Data Society
 
Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)
Aysylu Greenberg
 
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
ssuser610732
 

Similar to One Person, One Model, One World: Learning Continual User Representation without Forgetting SIGIR2021 (20)

Robot simulation for_control_design
Robot simulation for_control_designRobot simulation for_control_design
Robot simulation for_control_design
 
IRJET - Human Pose Detection using Deep Learning
IRJET - Human Pose Detection using Deep LearningIRJET - Human Pose Detection using Deep Learning
IRJET - Human Pose Detection using Deep Learning
 
Human pose detection using machine learning by Grandel
Human pose detection using machine learning by GrandelHuman pose detection using machine learning by Grandel
Human pose detection using machine learning by Grandel
 
It Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
It Does What You Say, Not What You Mean: Lessons From A Decade of Program RepairIt Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
It Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair
 
Ch01lect1 et
Ch01lect1 etCh01lect1 et
Ch01lect1 et
 
Best Practices - Software Engineering
Best Practices - Software EngineeringBest Practices - Software Engineering
Best Practices - Software Engineering
 
Basics of modeling and simulation
Basics of modeling and simulationBasics of modeling and simulation
Basics of modeling and simulation
 
Introduction to simulation and modeling
Introduction to simulation and modelingIntroduction to simulation and modeling
Introduction to simulation and modeling
 
Wanna see your open source project succeed? - Nurture your community
Wanna see your open source project succeed? - Nurture your communityWanna see your open source project succeed? - Nurture your community
Wanna see your open source project succeed? - Nurture your community
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
 
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOTA Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
A Research Paper on HUMAN MACHINE CONVERSATION USING CHATBOT
 
Real Time Processing Using Twitter Heron by Karthik Ramasamy
Real Time Processing Using Twitter Heron by Karthik RamasamyReal Time Processing Using Twitter Heron by Karthik Ramasamy
Real Time Processing Using Twitter Heron by Karthik Ramasamy
 
Software Analytics = Sharing Information
Software Analytics = Sharing InformationSoftware Analytics = Sharing Information
Software Analytics = Sharing Information
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
 
Simulation and modeling introduction.pptx
Simulation and modeling introduction.pptxSimulation and modeling introduction.pptx
Simulation and modeling introduction.pptx
 
Testing In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the CloudTesting In Production (TiP) Advances with Big Data and the Cloud
Testing In Production (TiP) Advances with Big Data and the Cloud
 
The Grand Unified Theory of Autonomous Systems, Humans and Simulation
The Grand Unified Theory of Autonomous Systems, Humans and SimulationThe Grand Unified Theory of Autonomous Systems, Humans and Simulation
The Grand Unified Theory of Autonomous Systems, Humans and Simulation
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)
 
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation ...
 

Recently uploaded

Call Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
Call Girls Lucknow 0000000000 Independent Call Girl Service LucknowCall Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
Call Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
hiju9823
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
eudsoh
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
Senior Software Profiles Backend Sample - Sheet1.pdf
Senior Software Profiles  Backend Sample - Sheet1.pdfSenior Software Profiles  Backend Sample - Sheet1.pdf
Senior Software Profiles Backend Sample - Sheet1.pdf
Vineet
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
keesa2
 
Sid Sigma educational and problem solving power point- Six Sigma.ppt
Sid Sigma educational and problem solving power point- Six Sigma.pptSid Sigma educational and problem solving power point- Six Sigma.ppt
Sid Sigma educational and problem solving power point- Six Sigma.ppt
ArshadAyub49
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
perranet1
 
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdfNamma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
22ad0301
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
uevausa
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
Vineet
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
zsafxbf
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
davidpietrzykowski1
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
nyvan3
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
blueshagoo1
 
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
9gr6pty
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
nhutnguyen355078
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
Bisnar Chase Personal Injury Attorneys
 

Recently uploaded (20)

Call Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
Call Girls Lucknow 0000000000 Independent Call Girl Service LucknowCall Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
Call Girls Lucknow 0000000000 Independent Call Girl Service Lucknow
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
 
Senior Software Profiles Backend Sample - Sheet1.pdf
Senior Software Profiles  Backend Sample - Sheet1.pdfSenior Software Profiles  Backend Sample - Sheet1.pdf
Senior Software Profiles Backend Sample - Sheet1.pdf
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
 
Sid Sigma educational and problem solving power point- Six Sigma.ppt
Sid Sigma educational and problem solving power point- Six Sigma.pptSid Sigma educational and problem solving power point- Six Sigma.ppt
Sid Sigma educational and problem solving power point- Six Sigma.ppt
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
 
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdfNamma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
 
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
一比一原版(uob毕业证书)伯明翰大学毕业证如何办理
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
 

One Person, One Model, One World: Learning Continual User Representation without Forgetting SIGIR2021

  • 1. Fajie Yuan (Westlake University, Tencent), Guoxiao Zhang(Tencent ), Alexandros Karatzoglou (Google Research), Joemon Jose (University of Glasgow), Beibei Kong (Tencent), Yudong Li(Tencent) One Person, One Model, One World: Learning Continual User Representation without Forgetting SIGIR2021 Data&Code: https://github.com/fajieyuan/SIGIR2021_Conure
  • 2.  Motivation  Related Work  Conure  Experiments Outline
  • 3. A person has different roles to play in life! But all these roles may have some commonalities, such as personalization, habits, preference. Our Motivation One Person, One Model, One World Our Focus: Whether we can build a user representation model that could keep learning throughout all sequential tasks without forgetting
  • 4. Car Rec News Rec Music Rec Video Rec Video Rec Social APP Social APP Video Rec Search Engine Browser Map APP Video Rec A person has different roles to play in life!But all these roles may have some commonalities, such as personalization, habits, preference.
  • 5. TikTok -- warm user Amazon —cold users No interaction Ads --- new users Clicking logs
  • 6. Keypoints • Necessity and possibility why lifelong learning for UR learning? • Lifelong learning paradigm throughout all tasks. • Performance gain for tasks have certain correlations. Using Lifelong learning techniques to solve recommendation tasks
  • 7.  Motivation  Related Work  Conure  Experiments Outline
  • 8. • Classical UR models (works well but is specific to only one task) SASRec(Kang et al ICDM2018) NextItNet (Yuan et al WSDM2019) DSSM(Huang et al CIKM2013) Grec (Yuan et al WWW2020 ) GRU4Rec (Hidasi et al ICLR2016)
  • 9. • PeterRec (Two-stage Transfer Learning):
  • 11. • Transfer Learning Paradigm Comparisons: (a) Standard TF (b) PeterRec (b) Conure (b) MTL Lifelong learning without parameter preserving
  • 12.  Motivation  Related Work  Conure  Experiments Outline
  • 13. • Catastrophic Forgetting : Parameter Changes Last hidden Vector Changes
  • 14. • Over-parameterization: —(1) the more parameters are pruned, the worse it performs —(2) performing retraining on the pruned network (i.e., “pr70+retrain”) regains its original accuracy quickly —(3) smaller models (i.e., (b))are also highly over-parameterized
  • 15. Conure is conceptually very simple, easy to implement, and applicable to various sequential encoder networks. • Conure architecture and learning process.
  • 16.  Motivation  Related Work  Conure  Experiments Outline
  • 17. • Datasets: TTL: https://drive.google.com/file/d/1imhHUsivh6oMEtEW-RwVc4OsDqn-xOaP/view ML: https://drive.google.com/file/d/1-_KmnZFaOdH11keLYVcgkf-kW_BaM266/view
  • 18. • Results: —(1) Conure largely outperforms other models on T3 because of the positive transfer from T1 and T2 —(2) Conure, PeterRec and FineAll largely outperforms SimMo because of of the positive transfer from T1 —(3) SinMoAll performs much worse on most tasks (except the last one) because of catastrophic forgetting
  • 19. • Ablation study- T2 for T3: —(1) Without training T2,Conure shows worse results,e.g., -6.5% on TTL20%
  • 20. • Ablation study- Task order: —(1) Conure is not sensitive to the task order.
  • 21. • Ablation study: —(1) pruning also works for the embedding layer —(1) Conure is not restricted to specialized sequential encoder. —(2) Conure with the Transformer backbone works a bit better than it with NextItNet.
  • 22. Contributions: —(1)providing the first lifelong learning paradigm for user representations. —(2)providing insights for forgetting and redundancy issues in user representation models —(3)designing Conure, the first lifelong learning algorithm - smple and easy to implement —(4)instantiazing Conure with NextItNet and Transformer backbones —(5)Extensive experiments with SOTA performance with many new discoveries and insights
  • 23.