SlideShare a Scribd company logo
1 of 8
Download to read offline
7/28/2015
1
J. A. Mazanec, http://raptor.mazanec.com:3000 1
IFITT PhD Seminar 2015
Text Mining Ideas & Examples
J. A. Mazanec
Modul University Vienna
J. A. Mazanec, https://raptor.mazanec.com 2
Contents
Purpose of the introductory presentation
Analyzing the projected images of
destinations
Classifying customer reviews via significant
word items
Extracting topics from positive versus
negative reviews
7/28/2015
2
J. A. Mazanec, https://raptor.mazanec.com 3
Destinations under similarity
competition
 Theoretical underpinnings
 Choosing attributes: affective image
 Connotations
 Survey data vs. Internet sources
 Retrieving co-occurrences
 (dis)similarity – Normalized Google Distance
 Visualizing with …
Dendrograms and
Image maps
J. A. Mazanec, https://raptor.mazanec.com 4
Wayne Chase‘s system of
emotionally positive connotations
Adoration Amazement Admiration Appreciation Affection
Amusement Bliss Amorousness Astonishment Cheer
Comfort Devotion Eagerness Delight Contentment
Fondness Enthusiasm Ecstasy Friendliness Excitement
Elation Gladness Infatuation Exhilaration Enjoyment
Gratitude Kindliness Exuberance Euphoria Hope
Liking Fun Exultation Peacefulness Love
Glee Happiness Lust Hilarity Joy
Relief Passion Merriment Jubilation Satisfaction
Tenderness Mirth Pleasure Serenity Trust
Surprise Pride Thankfulness Warmth Thrill
Rapture Wonder Well-being
7/28/2015
3
J. A. Mazanec, https://raptor.mazanec.com 5
Normalized Google Similarity
Distance (Cilibrasi & Vitanyi 2007)
LSL 
SA 
S
S
J. A. Mazanec, https://raptor.mazanec.com 6
Destination countries with similar
connotative environment
7/28/2015
4
J. A. Mazanec, https://raptor.mazanec.com 7
Destination countries in connotative
Google space
J. A. Mazanec, https://raptor.mazanec.com 8
Exercises
 In-class (group) work
Choose destinations and decide on the attributes
Retrieve from Internet with Google queries
Generate dendrograms and maps
Evaluate results
Comment on relative competitiveness
7/28/2015
5
J. A. Mazanec, https://raptor.mazanec.com 9
Classifying online customer reviews
 Objective: significant word items
 Underlying hypothesis
 Practical usage: identify symptomatic words
as early warning signal
 Analytical method: Penalized Support Vector
Machines
 Demo and exercising
J. A. Mazanec, https://raptor.mazanec.com 10
SVM-Support Vector Machine
(Meyer, 2012)
7/28/2015
6
J. A. Mazanec, https://raptor.mazanec.com 11
Extracting topics from positive &
negative online reviews
 Objective: explore customers‘ use of
language
 Underlying hypothesis
 Practical use: automatic doc annotation;
structure of customer language
 Analytical method: Latent Dirichlet Analysis
 Demo and exercising
LDA basics (Blei, 2012)
topic:= probability distribution over a fixed
vocabulary
distribution over topics
per-document distribution over topics
all documents share the same set of topics
topics, per-document topic distributions,
per-document per-word topic assignments =
hidden structure (that likely generated the
observed documents)
J. A. Mazanec, http://raptor.mazanec.com:3000 12
7/28/2015
7
Latent topics (Blei, 2012)
J. A. Mazanec, http://raptor.mazanec.com:3000 13
Graphical model for LDA (Blei, 2012)
J. A. Mazanec, http://raptor.mazanec.com:3000 14
compute the hidden topic structure ( = posterior distribution = conditional
distribution of the hidden variables given the documents)
β: topic distribution over words θ : proportion for topic k in document d
z: topic assignment for word n in doc d w: word n in document d
α, η
7/28/2015
8
The generative process and posterior
distribution
J. A. Mazanec, http://raptor.mazanec.com:3000 15
References
Becker, N., Werft, W., Toedt, G., Lichter, P. and Benner, A. (2009).
penalizedSVM: a R-package for feature selection SVM
classification. Bioinformatics 25(13): 1711–1712.
Blei, D. (2012). Probabilistic Topic Models. Communications of the
ACM 55(4): 77-84.
Grün, B. and K. Hornik (2011). topicmodels: An R Package for Fitting
Topic Models. Journal of Statistical Software 10(13): 1-30.
Mazanec, J. A. (2010). Tourism-Receiving Countries in Connotative
Google Space. Journal of Travel Research 49 (Nov): 501-512.
Meyer, D. (2011). Support Vector Machines: The Interface to libsvm
in Package e1071. University of Technology, Vienna.
J. A. Mazanec, http://raptor.mazanec.com:3000 16

More Related Content

What's hot

Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016Kunal Dash
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningQuantUniversity
 
Research Method EMBA chapter 11
Research Method EMBA chapter 11Research Method EMBA chapter 11
Research Method EMBA chapter 11Mazhar Poohlah
 
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataEvangelia Daskalaki
 
Research Method EMBA chapter 12
Research Method EMBA chapter 12Research Method EMBA chapter 12
Research Method EMBA chapter 12Mazhar Poohlah
 
Classification Scoring for Cleaning Inconsistent Survey Data
Classification Scoring for Cleaning Inconsistent Survey DataClassification Scoring for Cleaning Inconsistent Survey Data
Classification Scoring for Cleaning Inconsistent Survey DataCSCJournals
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-biasSyracuse University
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsTao Xie
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Shalin Hai-Jew
 
TensorFlow London: Progressive Growing of GANs for increased stability, quali...
TensorFlow London: Progressive Growing of GANs for increased stability, quali...TensorFlow London: Progressive Growing of GANs for increased stability, quali...
TensorFlow London: Progressive Growing of GANs for increased stability, quali...Seldon
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsBrowse Jobs
 

What's hot (14)

Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
 
Document
DocumentDocument
Document
 
Analytics
AnalyticsAnalytics
Analytics
 
Research Method EMBA chapter 11
Research Method EMBA chapter 11Research Method EMBA chapter 11
Research Method EMBA chapter 11
 
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
 
Research Method EMBA chapter 12
Research Method EMBA chapter 12Research Method EMBA chapter 12
Research Method EMBA chapter 12
 
Classification Scoring for Cleaning Inconsistent Survey Data
Classification Scoring for Cleaning Inconsistent Survey DataClassification Scoring for Cleaning Inconsistent Survey Data
Classification Scoring for Cleaning Inconsistent Survey Data
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-bias
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software Analytics
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
 
TensorFlow London: Progressive Growing of GANs for increased stability, quali...
TensorFlow London: Progressive Growing of GANs for increased stability, quali...TensorFlow London: Progressive Growing of GANs for increased stability, quali...
TensorFlow London: Progressive Growing of GANs for increased stability, quali...
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
 
An empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systemsAn empirical performance evaluation of relational keyword search systems
An empirical performance evaluation of relational keyword search systems
 

Similar to IFITT PhD Seminar 2015. Text Mining Ideas & Examples

Aspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete pptAspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete ppttanvikadam76
 
WIRE:WIsdom-awaRE computing
WIRE:WIsdom-awaRE computingWIRE:WIsdom-awaRE computing
WIRE:WIsdom-awaRE computingwire unitn
 
Cmaps as intellectual prosthesis (GERAS 34, Paris)
Cmaps as intellectual prosthesis (GERAS 34, Paris)Cmaps as intellectual prosthesis (GERAS 34, Paris)
Cmaps as intellectual prosthesis (GERAS 34, Paris)Lawrie Hunter
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmIJSRD
 
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHSAUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHScscpconf
 
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHSAUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHScsandit
 
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollSemantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollCROKODIl consortium
 
IEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsIEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsVijay Karan
 
IEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsIEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsVijay Karan
 
Seven Quality management & control tools
Seven Quality management & control toolsSeven Quality management & control tools
Seven Quality management & control toolsPMC Mentor
 
Learning Relations from Social Tagging Data
Learning Relations from Social Tagging DataLearning Relations from Social Tagging Data
Learning Relations from Social Tagging DataHang Dong
 
Predicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsPredicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsJulia Kiseleva
 
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...Dragan Gasevic
 
Contextual Recommendation of Social Updates, a tag-based framework
Contextual Recommendation of Social Updates, a tag-based frameworkContextual Recommendation of Social Updates, a tag-based framework
Contextual Recommendation of Social Updates, a tag-based frameworkAdrien Joly
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataIswarya M
 

Similar to IFITT PhD Seminar 2015. Text Mining Ideas & Examples (20)

Using Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive ComputingUsing Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive Computing
 
SwethaReddyAllam_Resume
SwethaReddyAllam_ResumeSwethaReddyAllam_Resume
SwethaReddyAllam_Resume
 
Aspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete pptAspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete ppt
 
WIRE:WIsdom-awaRE computing
WIRE:WIsdom-awaRE computingWIRE:WIsdom-awaRE computing
WIRE:WIsdom-awaRE computing
 
Presentation MSE 2022
Presentation MSE 2022Presentation MSE 2022
Presentation MSE 2022
 
Cmaps as intellectual prosthesis (GERAS 34, Paris)
Cmaps as intellectual prosthesis (GERAS 34, Paris)Cmaps as intellectual prosthesis (GERAS 34, Paris)
Cmaps as intellectual prosthesis (GERAS 34, Paris)
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHSAUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
 
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHSAUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
AUTOMATED SHORT ANSWER GRADER USING FRIENDSHIP GRAPHS
 
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollSemantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
 
IEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsIEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# Projects
 
IEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# ProjectsIEEE 2014 ASP.NET with C# Projects
IEEE 2014 ASP.NET with C# Projects
 
Seven Quality management & control tools
Seven Quality management & control toolsSeven Quality management & control tools
Seven Quality management & control tools
 
Learning Relations from Social Tagging Data
Learning Relations from Social Tagging DataLearning Relations from Social Tagging Data
Learning Relations from Social Tagging Data
 
Collaborative DL
Collaborative DLCollaborative DL
Collaborative DL
 
Predicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov ModelsPredicting Current User Intent with Contextual Markov Models
Predicting Current User Intent with Contextual Markov Models
 
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...
Nurturing the Connections: The Role of Quantitative Ethnography in Learning A...
 
Contextual Recommendation of Social Updates, a tag-based framework
Contextual Recommendation of Social Updates, a tag-based frameworkContextual Recommendation of Social Updates, a tag-based framework
Contextual Recommendation of Social Updates, a tag-based framework
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
 
Mapping Keywords to
Mapping Keywords to Mapping Keywords to
Mapping Keywords to
 

Recently uploaded

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 

Recently uploaded (20)

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 

IFITT PhD Seminar 2015. Text Mining Ideas & Examples

  • 1. 7/28/2015 1 J. A. Mazanec, http://raptor.mazanec.com:3000 1 IFITT PhD Seminar 2015 Text Mining Ideas & Examples J. A. Mazanec Modul University Vienna J. A. Mazanec, https://raptor.mazanec.com 2 Contents Purpose of the introductory presentation Analyzing the projected images of destinations Classifying customer reviews via significant word items Extracting topics from positive versus negative reviews
  • 2. 7/28/2015 2 J. A. Mazanec, https://raptor.mazanec.com 3 Destinations under similarity competition  Theoretical underpinnings  Choosing attributes: affective image  Connotations  Survey data vs. Internet sources  Retrieving co-occurrences  (dis)similarity – Normalized Google Distance  Visualizing with … Dendrograms and Image maps J. A. Mazanec, https://raptor.mazanec.com 4 Wayne Chase‘s system of emotionally positive connotations Adoration Amazement Admiration Appreciation Affection Amusement Bliss Amorousness Astonishment Cheer Comfort Devotion Eagerness Delight Contentment Fondness Enthusiasm Ecstasy Friendliness Excitement Elation Gladness Infatuation Exhilaration Enjoyment Gratitude Kindliness Exuberance Euphoria Hope Liking Fun Exultation Peacefulness Love Glee Happiness Lust Hilarity Joy Relief Passion Merriment Jubilation Satisfaction Tenderness Mirth Pleasure Serenity Trust Surprise Pride Thankfulness Warmth Thrill Rapture Wonder Well-being
  • 3. 7/28/2015 3 J. A. Mazanec, https://raptor.mazanec.com 5 Normalized Google Similarity Distance (Cilibrasi & Vitanyi 2007) LSL  SA  S S J. A. Mazanec, https://raptor.mazanec.com 6 Destination countries with similar connotative environment
  • 4. 7/28/2015 4 J. A. Mazanec, https://raptor.mazanec.com 7 Destination countries in connotative Google space J. A. Mazanec, https://raptor.mazanec.com 8 Exercises  In-class (group) work Choose destinations and decide on the attributes Retrieve from Internet with Google queries Generate dendrograms and maps Evaluate results Comment on relative competitiveness
  • 5. 7/28/2015 5 J. A. Mazanec, https://raptor.mazanec.com 9 Classifying online customer reviews  Objective: significant word items  Underlying hypothesis  Practical usage: identify symptomatic words as early warning signal  Analytical method: Penalized Support Vector Machines  Demo and exercising J. A. Mazanec, https://raptor.mazanec.com 10 SVM-Support Vector Machine (Meyer, 2012)
  • 6. 7/28/2015 6 J. A. Mazanec, https://raptor.mazanec.com 11 Extracting topics from positive & negative online reviews  Objective: explore customers‘ use of language  Underlying hypothesis  Practical use: automatic doc annotation; structure of customer language  Analytical method: Latent Dirichlet Analysis  Demo and exercising LDA basics (Blei, 2012) topic:= probability distribution over a fixed vocabulary distribution over topics per-document distribution over topics all documents share the same set of topics topics, per-document topic distributions, per-document per-word topic assignments = hidden structure (that likely generated the observed documents) J. A. Mazanec, http://raptor.mazanec.com:3000 12
  • 7. 7/28/2015 7 Latent topics (Blei, 2012) J. A. Mazanec, http://raptor.mazanec.com:3000 13 Graphical model for LDA (Blei, 2012) J. A. Mazanec, http://raptor.mazanec.com:3000 14 compute the hidden topic structure ( = posterior distribution = conditional distribution of the hidden variables given the documents) β: topic distribution over words θ : proportion for topic k in document d z: topic assignment for word n in doc d w: word n in document d α, η
  • 8. 7/28/2015 8 The generative process and posterior distribution J. A. Mazanec, http://raptor.mazanec.com:3000 15 References Becker, N., Werft, W., Toedt, G., Lichter, P. and Benner, A. (2009). penalizedSVM: a R-package for feature selection SVM classification. Bioinformatics 25(13): 1711–1712. Blei, D. (2012). Probabilistic Topic Models. Communications of the ACM 55(4): 77-84. Grün, B. and K. Hornik (2011). topicmodels: An R Package for Fitting Topic Models. Journal of Statistical Software 10(13): 1-30. Mazanec, J. A. (2010). Tourism-Receiving Countries in Connotative Google Space. Journal of Travel Research 49 (Nov): 501-512. Meyer, D. (2011). Support Vector Machines: The Interface to libsvm in Package e1071. University of Technology, Vienna. J. A. Mazanec, http://raptor.mazanec.com:3000 16