SlideShare a Scribd company logo
1 of 30
eduworks-network.eu 
facebook.com/eduworksnetwork 
@EduworksNetwork 
Discovering Emerging Research 
Topic and Trends in W&O 
Psychology by Text Mining 
Scientific Articles 
Vladimer Kobayashi, Stefan T Mol, Ph.D., Gábor 
Kismihók, Ph.D.
Contents 
• Background – Why this study? Hasn’t this been done before? 
• Objectives – What are we really trying to do here? 
• Materials – The ingredients 
• Methods – The tools 
• Results – Show me the outcome! 
• Conclusion and Future Work – What has been achieved? How 
to proceed 
Kobayashi, Mol, & Kismihók - University of Amsterdam 2
Kobayashi, Mol, & Kismihók - University of Amsterdam 3
Background 
• The psychological literature is huge (PsychINFO abstracts 3.7 million 
documents and PubPsych has 900,000 searchable records) 
• Text Mining applications 
• Mining biomedical literature 
• Web textual data 
• Opinion and Sentiment mining from product reviews, microblogging, users’ posts and 
comments. 
• Text Mining opportunities for gaining insight into trends in the scientific 
literature 
• Key term extraction to support efficient document search and retrieval 
• Identifying topics to group document with similar themes 
• So far little text mining effort has been made in the W&O psychology 
Literature 
Kobayashi, Mol, & Kismihók - University of Amsterdam 4
Kobayashi, Mol, & Kismihók - University of Amsterdam 5
Objectives 
• Apply text mining, specifically, topic modeling techniques to the 
W&O literature 
• Pair topics and publication dates to reveal topical trends in this 
field 
Contributions 
• Efficient search and retrieval of W&O psychology literature 
• Supporting systematic literature review and automatic 
knowledge discovery 
• Identifying topics (or themes) and topic trends 
Kobayashi, Mol, & Kismihók - University of Amsterdam 6
Kobayashi, Mol, & Kismihók - University of Amsterdam 7
Terminology 
• Document – a file that contains sequence of characters or text 
• Corpus – collection of documents 
• Term – smallest unit in a document (e.g. word, phrase, 
sentence, or even a single character) 
• Vocabulary or lexicon – set of all unique terms 
Kobayashi, Mol, & Kismihók - University of Amsterdam 8
Kobayashi, Mol, & Kismihók - University of Amsterdam 9
SOURCE 
• Abstracts from 4 journals 
1975-2014 
1096 abstracts 
2008-2014 
89 abstracts 
1977-2014 
1115 abstracts 
1991-2014 
602 abstracts 
Total number of abstracts: 2902 
Kobayashi, Mol, & Kismihók - University of Amsterdam 10
For this study… 
• DOCUMENT 
• A single abstract 
• CORPUS 
• Collection of abstracts 
• TERMS 
• Words 
• VOCABULARY 
• Set of all unique words (after preprocessing) in the corpus 
Kobayashi, Mol, & Kismihók - University of Amsterdam 11
Why Abstracts only? 
• The abstract contains the gist of the whole article 
• Commonly, articles are indexed based on titles, keywords and 
abstracts. 
Kobayashi, Mol, & Kismihók - University of Amsterdam 12
Kobayashi, Mol, & Kismihók - University of Amsterdam 13
Techniques 
• String Processing 
• Natural Language Processing 
• Topic Modeling 
• Latent Dirichlet Allocation Model 
• Assumes that each document is a mixture of topics 
• Each word is generated from a specific topic 
• An algorithm for topic discovery 
• Topical Trend Analysis 
Kobayashi, Mol, & Kismihók - University of Amsterdam 14
Analysis done separately for each journal 
Kobayashi, Mol, & Kismihók - University of Amsterdam 15
Original abstract 
Preprocessed abstract 
 Lower case transformation 
 Stopwords removal 
 Delete punctuations 
 Stemming 
Kobayashi, Mol, & Kismihók - University of Amsterdam 16
Abstracts 
Vocabulary The document-by-term 
matrix 
a a 
  
11 1 
N 
  
  
 a a 
V 1 
VN 
 
Documents 
The entries (the a’s) are the tf-idf 
weight of the terms in each 
document 
Kobayashi, Mol, & Kismihók - University of Amsterdam 17
tf-idf 
• There are many ways to assign weights to terms in the 
documents 
• The most popular is the tf-idf, computed by 
, , tf-idf tf idf t d t d t   
frequency of term t in document d inverse document frequency of term t 
idf log 
N 
t 
number of documents in the corpus where t 
occurs  
Kobayashi, Mol, & Kismihók - University of Amsterdam 18
a a 
  
11 1 
N 
  
  
 a a 
V 1 
VN 
 
Documents 
Vocabulary 
Apply Latent Dirichlet 
Allocation Model 
1. List of Topics 
2. Topic classification of 
documents 
Apply separately for each journal 
Kobayashi, Mol, & Kismihók - University of Amsterdam 19
Topical Trends 
• Topic for each document 
• Publication dates of documents 
• Create a chart depicting the evolution of topics from the 
publication dates and topics of the documents 
Kobayashi, Mol, & Kismihók - University of Amsterdam 20
Document Topic Publication Date 
Document 1 Topic 3 1990 
Document 2 Topic 5 1993 
… … … 
Document N Topic 12 1998 
Publication Date Topic 1 Topic T 
1975 Number of 
publications 
… Number of 
publications 
1976 Number of 
publications 
… Number of 
publications 
… … … … 
2014 Number of 
publications 
… Number of 
publications 
Kobayashi, Mol, & Kismihók - University of Amsterdam 21
Kobayashi, Mol, & Kismihók - University of Amsterdam 22
Kobayashi, Mol, & Kismihók - University of Amsterdam 23
Kobayashi, Mol, & Kismihók - University of Amsterdam 24
Kobayashi, Mol, & Kismihók - University of Amsterdam 25
Kobayashi, Mol, & Kismihók - University of Amsterdam 26
Conclusion 
Demonstrated the use of text mining to this type of application 
Idea of what is keeping the researchers of W&O psychology 
busy 
Offers a view of how W&O Psychology topics evolve and gain 
attention (which might reflect the development and maturation 
of the field) 
Can be alternative to traditional content analysis 
Facilitate peer review process by suggesting to researchers the 
outlet that will most likely accept their work. 
Kobayashi, Mol, & Kismihók - University of Amsterdam 27
Future Work 
• Aside from extracting topics one can also extract concepts, 
techniques, and key issues 
• Create a hierarchy of topics 
• Consider other parts of the document and not just the abstract. 
Kobayashi, Mol, & Kismihók - University of Amsterdam 28
MAIN REFERENCES 
• Learning Topic Models by Arora, Ge, and Moitra (2012) 
• Text Mining Infrastructure in R by Feinerer, Hornik, and Meyer 
(2008) 
• Understanding Evolution of Research Themes by Wang, Zhai, 
and Roth (2013) 
Kobayashi, Mol, & Kismihók - University of Amsterdam 29
ACKNOWLEDGEMENT 
• We would like to thank our colleague Ms Sofija Pajic for helping 
us out in interpreting the topics. 
Kobayashi, Mol, & Kismihók - University of Amsterdam 30

More Related Content

Viewers also liked

Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...
Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...
Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...Eduworks Network
 
Eduworks kick-off presentation: CEU
Eduworks kick-off presentation: CEUEduworks kick-off presentation: CEU
Eduworks kick-off presentation: CEUEduworks Network
 
Eduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks Network
 
Aias newsletter autumn 2014
Aias newsletter autumn 2014Aias newsletter autumn 2014
Aias newsletter autumn 2014Eduworks Network
 
Summary of Eduworks project
Summary of Eduworks projectSummary of Eduworks project
Summary of Eduworks projectEduworks Network
 
9th dutch flemish meeting presentation serlie mol
9th dutch flemish meeting presentation serlie mol9th dutch flemish meeting presentation serlie mol
9th dutch flemish meeting presentation serlie molEduworks Network
 
EAWOP SGM presentation slides
EAWOP SGM presentation slidesEAWOP SGM presentation slides
EAWOP SGM presentation slidesEduworks Network
 
Eduworks kick-off presentation: U-SIEGEN
Eduworks kick-off presentation: U-SIEGENEduworks kick-off presentation: U-SIEGEN
Eduworks kick-off presentation: U-SIEGENEduworks Network
 
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014Eduworks Network
 
Eduworks summer school 2014, detailed programme
Eduworks summer school 2014, detailed programmeEduworks summer school 2014, detailed programme
Eduworks summer school 2014, detailed programmeEduworks Network
 
Barriers to Adoption for Learning Analytics at a Dutch University
Barriers to Adoption for Learning Analytics at a Dutch UniversityBarriers to Adoption for Learning Analytics at a Dutch University
Barriers to Adoption for Learning Analytics at a Dutch UniversityEduworks Network
 
Eduworks presentation at Textkernel 17-01-2014
Eduworks presentation at Textkernel 17-01-2014Eduworks presentation at Textkernel 17-01-2014
Eduworks presentation at Textkernel 17-01-2014Eduworks Network
 
What have you learnt about technologies from the process of condtructing this...
What have you learnt about technologies from the process of condtructing this...What have you learnt about technologies from the process of condtructing this...
What have you learnt about technologies from the process of condtructing this...08cornicktho
 
Eduworks kick-off presentation: Corvinno
Eduworks kick-off presentation: CorvinnoEduworks kick-off presentation: Corvinno
Eduworks kick-off presentation: CorvinnoEduworks Network
 
Ke hoach truyen thong xi mang hoang long - Hà Quốc Nam
Ke hoach truyen thong xi mang hoang long - Hà Quốc NamKe hoach truyen thong xi mang hoang long - Hà Quốc Nam
Ke hoach truyen thong xi mang hoang long - Hà Quốc NamQuoc Nam
 

Viewers also liked (18)

Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...
Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...
Extending Computerized Adaptive Testing to Multiple Objectives: Envisioned on...
 
Ingrid 2014 stefan mol
Ingrid 2014 stefan molIngrid 2014 stefan mol
Ingrid 2014 stefan mol
 
Eduworks kick-off presentation: CEU
Eduworks kick-off presentation: CEUEduworks kick-off presentation: CEU
Eduworks kick-off presentation: CEU
 
Eduworks kick-off presentation: USAL
Eduworks kick-off presentation: USALEduworks kick-off presentation: USAL
Eduworks kick-off presentation: USAL
 
Aias newsletter autumn 2014
Aias newsletter autumn 2014Aias newsletter autumn 2014
Aias newsletter autumn 2014
 
Summary of Eduworks project
Summary of Eduworks projectSummary of Eduworks project
Summary of Eduworks project
 
9th dutch flemish meeting presentation serlie mol
9th dutch flemish meeting presentation serlie mol9th dutch flemish meeting presentation serlie mol
9th dutch flemish meeting presentation serlie mol
 
EAWOP SGM presentation slides
EAWOP SGM presentation slidesEAWOP SGM presentation slides
EAWOP SGM presentation slides
 
Eduworks kick-off presentation: U-SIEGEN
Eduworks kick-off presentation: U-SIEGENEduworks kick-off presentation: U-SIEGEN
Eduworks kick-off presentation: U-SIEGEN
 
JIA & RMB Manchester
JIA & RMB ManchesterJIA & RMB Manchester
JIA & RMB Manchester
 
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014
H Berkers & V.Kobayashi: Small talk ORM paper 29 9-2014
 
Eduworks summer school 2014, detailed programme
Eduworks summer school 2014, detailed programmeEduworks summer school 2014, detailed programme
Eduworks summer school 2014, detailed programme
 
Barriers to Adoption for Learning Analytics at a Dutch University
Barriers to Adoption for Learning Analytics at a Dutch UniversityBarriers to Adoption for Learning Analytics at a Dutch University
Barriers to Adoption for Learning Analytics at a Dutch University
 
Eduworks presentation at Textkernel 17-01-2014
Eduworks presentation at Textkernel 17-01-2014Eduworks presentation at Textkernel 17-01-2014
Eduworks presentation at Textkernel 17-01-2014
 
What have you learnt about technologies from the process of condtructing this...
What have you learnt about technologies from the process of condtructing this...What have you learnt about technologies from the process of condtructing this...
What have you learnt about technologies from the process of condtructing this...
 
Eduworks kick-off presentation: Corvinno
Eduworks kick-off presentation: CorvinnoEduworks kick-off presentation: Corvinno
Eduworks kick-off presentation: Corvinno
 
Aiec & csr presentation
Aiec & csr presentationAiec & csr presentation
Aiec & csr presentation
 
Ke hoach truyen thong xi mang hoang long - Hà Quốc Nam
Ke hoach truyen thong xi mang hoang long - Hà Quốc NamKe hoach truyen thong xi mang hoang long - Hà Quốc Nam
Ke hoach truyen thong xi mang hoang long - Hà Quốc Nam
 

Similar to Discovering Emerging Research Topics in Work and Organizational Psychology

Tactics on Research and Advanced Optimization Techniques in Engineering Appl...
Tactics on Research and Advanced Optimization Techniques  in Engineering Appl...Tactics on Research and Advanced Optimization Techniques  in Engineering Appl...
Tactics on Research and Advanced Optimization Techniques in Engineering Appl...Ajay Kumar
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondJonathan Bowen
 
Lecture 6 - Literature Review.pptx
Lecture 6 - Literature Review.pptxLecture 6 - Literature Review.pptx
Lecture 6 - Literature Review.pptxHafeezUllah783173
 
Lecture 9.28.10
Lecture 9.28.10Lecture 9.28.10
Lecture 9.28.10VMRoberts
 
Research (Technical Report Writing)
Research (Technical Report Writing) Research (Technical Report Writing)
Research (Technical Report Writing) Raj Kumar
 
Scientific writing as a means of communication
Scientific writing as a means of communicationScientific writing as a means of communication
Scientific writing as a means of communicationjilu123
 
02 Literature search and reviewing_1.pptx
02  Literature search and reviewing_1.pptx02  Literature search and reviewing_1.pptx
02 Literature search and reviewing_1.pptxssusere05ec21
 
Print source literature 24 March 2023.pptx
Print source literature 24 March 2023.pptxPrint source literature 24 March 2023.pptx
Print source literature 24 March 2023.pptxsanjaychavan62
 
Portland Place School June 2018
Portland Place School June 2018 Portland Place School June 2018
Portland Place School June 2018 EISLibrarian
 
Seminar Course instruction .ppt
Seminar Course instruction .pptSeminar Course instruction .ppt
Seminar Course instruction .pptMeymunaMohammed1
 
Iatefl 2019 How to be successful in discovering and publishing research
Iatefl 2019 How to be successful in discovering and publishing researchIatefl 2019 How to be successful in discovering and publishing research
Iatefl 2019 How to be successful in discovering and publishing researchCaroline Moore
 
UG dissertation workshop 2013
UG dissertation workshop 2013UG dissertation workshop 2013
UG dissertation workshop 2013Carolina Matos
 
Research Methodology-Introduction to Research 24.06.20.ppt
Research Methodology-Introduction to Research 24.06.20.pptResearch Methodology-Introduction to Research 24.06.20.ppt
Research Methodology-Introduction to Research 24.06.20.pptPritiJadhao5
 
Math history r
Math history rMath history r
Math history rJane Wu
 
Math history r
Math history rMath history r
Math history rJane Wu
 
Literature Review(For Young Researchers)
Literature Review(For Young Researchers)Literature Review(For Young Researchers)
Literature Review(For Young Researchers)DrAmitPurushottam
 

Similar to Discovering Emerging Research Topics in Work and Organizational Psychology (20)

Tactics on Research and Advanced Optimization Techniques in Engineering Appl...
Tactics on Research and Advanced Optimization Techniques  in Engineering Appl...Tactics on Research and Advanced Optimization Techniques  in Engineering Appl...
Tactics on Research and Advanced Optimization Techniques in Engineering Appl...
 
08. EDT 513 2023 Week 8.pptx
08. EDT 513 2023 Week 8.pptx08. EDT 513 2023 Week 8.pptx
08. EDT 513 2023 Week 8.pptx
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyond
 
THESIS OUTLINE.pptx
THESIS OUTLINE.pptxTHESIS OUTLINE.pptx
THESIS OUTLINE.pptx
 
Lecture 6 - Literature Review.pptx
Lecture 6 - Literature Review.pptxLecture 6 - Literature Review.pptx
Lecture 6 - Literature Review.pptx
 
Lecture 9.28.10
Lecture 9.28.10Lecture 9.28.10
Lecture 9.28.10
 
Structure of a research paper
Structure of a research paperStructure of a research paper
Structure of a research paper
 
Research (Technical Report Writing)
Research (Technical Report Writing) Research (Technical Report Writing)
Research (Technical Report Writing)
 
Scientific writing as a means of communication
Scientific writing as a means of communicationScientific writing as a means of communication
Scientific writing as a means of communication
 
02 Literature search and reviewing_1.pptx
02  Literature search and reviewing_1.pptx02  Literature search and reviewing_1.pptx
02 Literature search and reviewing_1.pptx
 
Print source literature 24 March 2023.pptx
Print source literature 24 March 2023.pptxPrint source literature 24 March 2023.pptx
Print source literature 24 March 2023.pptx
 
Portland Place School June 2018
Portland Place School June 2018 Portland Place School June 2018
Portland Place School June 2018
 
Seminar Course instruction .ppt
Seminar Course instruction .pptSeminar Course instruction .ppt
Seminar Course instruction .ppt
 
Iatefl 2019 How to be successful in discovering and publishing research
Iatefl 2019 How to be successful in discovering and publishing researchIatefl 2019 How to be successful in discovering and publishing research
Iatefl 2019 How to be successful in discovering and publishing research
 
UG dissertation workshop 2013
UG dissertation workshop 2013UG dissertation workshop 2013
UG dissertation workshop 2013
 
Research Methodology-Introduction to Research 24.06.20.ppt
Research Methodology-Introduction to Research 24.06.20.pptResearch Methodology-Introduction to Research 24.06.20.ppt
Research Methodology-Introduction to Research 24.06.20.ppt
 
Selection of scholarly journal
Selection of scholarly journal Selection of scholarly journal
Selection of scholarly journal
 
Math history r
Math history rMath history r
Math history r
 
Math history r
Math history rMath history r
Math history r
 
Literature Review(For Young Researchers)
Literature Review(For Young Researchers)Literature Review(For Young Researchers)
Literature Review(For Young Researchers)
 

More from Eduworks Network

Industrial Relations and Inequality in the Spanish Labour Market
Industrial Relations and Inequality in the Spanish Labour MarketIndustrial Relations and Inequality in the Spanish Labour Market
Industrial Relations and Inequality in the Spanish Labour MarketEduworks Network
 
Learning analytics the good the bad & the ugly
Learning analytics the good the bad & the uglyLearning analytics the good the bad & the ugly
Learning analytics the good the bad & the uglyEduworks Network
 
Dutch flemish 2013 presentation
Dutch flemish 2013 presentationDutch flemish 2013 presentation
Dutch flemish 2013 presentationEduworks Network
 
Labour Market Driven Learning Analytics
Labour Market Driven Learning AnalyticsLabour Market Driven Learning Analytics
Labour Market Driven Learning AnalyticsEduworks Network
 
Eduworks kick-off presentation: TCD
Eduworks kick-off presentation: TCDEduworks kick-off presentation: TCD
Eduworks kick-off presentation: TCDEduworks Network
 
Eduworks kick-off presentation: UvA - ABS
Eduworks kick-off presentation: UvA - ABSEduworks kick-off presentation: UvA - ABS
Eduworks kick-off presentation: UvA - ABSEduworks Network
 
Eduworks kick-off presentation: UvA AIAS
Eduworks kick-off presentation: UvA AIASEduworks kick-off presentation: UvA AIAS
Eduworks kick-off presentation: UvA AIASEduworks Network
 

More from Eduworks Network (14)

Janine berg
Janine bergJanine berg
Janine berg
 
Vladimer Kobayashi
Vladimer KobayashiVladimer Kobayashi
Vladimer Kobayashi
 
Alan Berg
Alan Berg Alan Berg
Alan Berg
 
Brian Fabo
Brian FaboBrian Fabo
Brian Fabo
 
Pablo de Pedraza
Pablo de PedrazaPablo de Pedraza
Pablo de Pedraza
 
Sisay Chala
Sisay ChalaSisay Chala
Sisay Chala
 
Industrial Relations and Inequality in the Spanish Labour Market
Industrial Relations and Inequality in the Spanish Labour MarketIndustrial Relations and Inequality in the Spanish Labour Market
Industrial Relations and Inequality in the Spanish Labour Market
 
Learning analytics the good the bad & the ugly
Learning analytics the good the bad & the uglyLearning analytics the good the bad & the ugly
Learning analytics the good the bad & the ugly
 
Dutch flemish 2013 presentation
Dutch flemish 2013 presentationDutch flemish 2013 presentation
Dutch flemish 2013 presentation
 
Labour Market Driven Learning Analytics
Labour Market Driven Learning AnalyticsLabour Market Driven Learning Analytics
Labour Market Driven Learning Analytics
 
JIA Cardiff
JIA CardiffJIA Cardiff
JIA Cardiff
 
Eduworks kick-off presentation: TCD
Eduworks kick-off presentation: TCDEduworks kick-off presentation: TCD
Eduworks kick-off presentation: TCD
 
Eduworks kick-off presentation: UvA - ABS
Eduworks kick-off presentation: UvA - ABSEduworks kick-off presentation: UvA - ABS
Eduworks kick-off presentation: UvA - ABS
 
Eduworks kick-off presentation: UvA AIAS
Eduworks kick-off presentation: UvA AIASEduworks kick-off presentation: UvA AIAS
Eduworks kick-off presentation: UvA AIAS
 

Recently uploaded

BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 

Recently uploaded (20)

BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 

Discovering Emerging Research Topics in Work and Organizational Psychology

  • 1. eduworks-network.eu facebook.com/eduworksnetwork @EduworksNetwork Discovering Emerging Research Topic and Trends in W&O Psychology by Text Mining Scientific Articles Vladimer Kobayashi, Stefan T Mol, Ph.D., Gábor Kismihók, Ph.D.
  • 2. Contents • Background – Why this study? Hasn’t this been done before? • Objectives – What are we really trying to do here? • Materials – The ingredients • Methods – The tools • Results – Show me the outcome! • Conclusion and Future Work – What has been achieved? How to proceed Kobayashi, Mol, & Kismihók - University of Amsterdam 2
  • 3. Kobayashi, Mol, & Kismihók - University of Amsterdam 3
  • 4. Background • The psychological literature is huge (PsychINFO abstracts 3.7 million documents and PubPsych has 900,000 searchable records) • Text Mining applications • Mining biomedical literature • Web textual data • Opinion and Sentiment mining from product reviews, microblogging, users’ posts and comments. • Text Mining opportunities for gaining insight into trends in the scientific literature • Key term extraction to support efficient document search and retrieval • Identifying topics to group document with similar themes • So far little text mining effort has been made in the W&O psychology Literature Kobayashi, Mol, & Kismihók - University of Amsterdam 4
  • 5. Kobayashi, Mol, & Kismihók - University of Amsterdam 5
  • 6. Objectives • Apply text mining, specifically, topic modeling techniques to the W&O literature • Pair topics and publication dates to reveal topical trends in this field Contributions • Efficient search and retrieval of W&O psychology literature • Supporting systematic literature review and automatic knowledge discovery • Identifying topics (or themes) and topic trends Kobayashi, Mol, & Kismihók - University of Amsterdam 6
  • 7. Kobayashi, Mol, & Kismihók - University of Amsterdam 7
  • 8. Terminology • Document – a file that contains sequence of characters or text • Corpus – collection of documents • Term – smallest unit in a document (e.g. word, phrase, sentence, or even a single character) • Vocabulary or lexicon – set of all unique terms Kobayashi, Mol, & Kismihók - University of Amsterdam 8
  • 9. Kobayashi, Mol, & Kismihók - University of Amsterdam 9
  • 10. SOURCE • Abstracts from 4 journals 1975-2014 1096 abstracts 2008-2014 89 abstracts 1977-2014 1115 abstracts 1991-2014 602 abstracts Total number of abstracts: 2902 Kobayashi, Mol, & Kismihók - University of Amsterdam 10
  • 11. For this study… • DOCUMENT • A single abstract • CORPUS • Collection of abstracts • TERMS • Words • VOCABULARY • Set of all unique words (after preprocessing) in the corpus Kobayashi, Mol, & Kismihók - University of Amsterdam 11
  • 12. Why Abstracts only? • The abstract contains the gist of the whole article • Commonly, articles are indexed based on titles, keywords and abstracts. Kobayashi, Mol, & Kismihók - University of Amsterdam 12
  • 13. Kobayashi, Mol, & Kismihók - University of Amsterdam 13
  • 14. Techniques • String Processing • Natural Language Processing • Topic Modeling • Latent Dirichlet Allocation Model • Assumes that each document is a mixture of topics • Each word is generated from a specific topic • An algorithm for topic discovery • Topical Trend Analysis Kobayashi, Mol, & Kismihók - University of Amsterdam 14
  • 15. Analysis done separately for each journal Kobayashi, Mol, & Kismihók - University of Amsterdam 15
  • 16. Original abstract Preprocessed abstract  Lower case transformation  Stopwords removal  Delete punctuations  Stemming Kobayashi, Mol, & Kismihók - University of Amsterdam 16
  • 17. Abstracts Vocabulary The document-by-term matrix a a   11 1 N      a a V 1 VN  Documents The entries (the a’s) are the tf-idf weight of the terms in each document Kobayashi, Mol, & Kismihók - University of Amsterdam 17
  • 18. tf-idf • There are many ways to assign weights to terms in the documents • The most popular is the tf-idf, computed by , , tf-idf tf idf t d t d t   frequency of term t in document d inverse document frequency of term t idf log N t number of documents in the corpus where t occurs  Kobayashi, Mol, & Kismihók - University of Amsterdam 18
  • 19. a a   11 1 N      a a V 1 VN  Documents Vocabulary Apply Latent Dirichlet Allocation Model 1. List of Topics 2. Topic classification of documents Apply separately for each journal Kobayashi, Mol, & Kismihók - University of Amsterdam 19
  • 20. Topical Trends • Topic for each document • Publication dates of documents • Create a chart depicting the evolution of topics from the publication dates and topics of the documents Kobayashi, Mol, & Kismihók - University of Amsterdam 20
  • 21. Document Topic Publication Date Document 1 Topic 3 1990 Document 2 Topic 5 1993 … … … Document N Topic 12 1998 Publication Date Topic 1 Topic T 1975 Number of publications … Number of publications 1976 Number of publications … Number of publications … … … … 2014 Number of publications … Number of publications Kobayashi, Mol, & Kismihók - University of Amsterdam 21
  • 22. Kobayashi, Mol, & Kismihók - University of Amsterdam 22
  • 23. Kobayashi, Mol, & Kismihók - University of Amsterdam 23
  • 24. Kobayashi, Mol, & Kismihók - University of Amsterdam 24
  • 25. Kobayashi, Mol, & Kismihók - University of Amsterdam 25
  • 26. Kobayashi, Mol, & Kismihók - University of Amsterdam 26
  • 27. Conclusion Demonstrated the use of text mining to this type of application Idea of what is keeping the researchers of W&O psychology busy Offers a view of how W&O Psychology topics evolve and gain attention (which might reflect the development and maturation of the field) Can be alternative to traditional content analysis Facilitate peer review process by suggesting to researchers the outlet that will most likely accept their work. Kobayashi, Mol, & Kismihók - University of Amsterdam 27
  • 28. Future Work • Aside from extracting topics one can also extract concepts, techniques, and key issues • Create a hierarchy of topics • Consider other parts of the document and not just the abstract. Kobayashi, Mol, & Kismihók - University of Amsterdam 28
  • 29. MAIN REFERENCES • Learning Topic Models by Arora, Ge, and Moitra (2012) • Text Mining Infrastructure in R by Feinerer, Hornik, and Meyer (2008) • Understanding Evolution of Research Themes by Wang, Zhai, and Roth (2013) Kobayashi, Mol, & Kismihók - University of Amsterdam 29
  • 30. ACKNOWLEDGEMENT • We would like to thank our colleague Ms Sofija Pajic for helping us out in interpreting the topics. Kobayashi, Mol, & Kismihók - University of Amsterdam 30

Editor's Notes

  1. Please (and always) write out all first names in full
  2. Although these are the usual suspects for a contents section, it may be good to replace these with titles that provide some more detail as to the exact content within each section (I used to have content sections like these, and presenting them is somewhat boring)
  3. I would probably not dedicate a full slide to this.
  4. Replace “Immense number of psychological literature ” with “The psychological literature is huge” Replace “Text Mining opportunities for scientific literature” with “Text Mining opportunities for gaining insight into trends in the scientific literature” Replace “Identify” with “Identifying” Replace “has been done to” with “has been made in the”
  5. I would probably not dedicate a full slide to this
  6. Replace “on the” with “to the” Replace “indentify” with “identifying”
  7. I would probably not dedicate a full slide to this
  8. Replace “tekst” with “text”
  9. I would probably not dedicate a full slide to this
  10. Replace “abstract” with “abstracts”
  11. I would probably not dedicate a full slide to this
  12. I would probably not dedicate a full slide to this
  13. So after Lower case transformation, Stopwords removal, Delete punctuations, and Stemming, how are these topics regenerated? Here I see punctuation for instance. To empower has little to do with team effectiveness Maybe we should quickly discuss this slide (the topic interpretation part). In my understanding it would be akin to the naming of factors in an exploratory factor analysis?
  14. Maybe use journal logo’s here instead of the yellow acronym? Organizational behavior is at a different level of abstraction than bullying and harassment (in fact you may say that the former contains the latter)
  15. Insert journal logo? It is not directly clear to me what impact assessment is.
  16. I would probably not dedicate a full slide to this
  17. I would drop the word successfully. Leave this to the audience to judge. The numbers imply an order that is not really there. I would suggest dropping these.
  18. For the hierarchy of topics you may use the organizational behavior example