SlideShare a Scribd company logo
1 of 21
ENTER 2018 Research Track Slide Number 1
Automatic Summarization of
Multiple Travel Blog Entries
Focusing on Travelers’ Behavior
Shumpei Iinuma
* Hidetugu Nanba
Toshiyuki Takezawa
Hiroshima City University, JAPAN
nanba@Hiroshima-cu.ac.jp
http://www.ls.info.Hiroshima-cu.ac.jp/~nanba
ENTER 2018 Research Track Slide Number 2
Purpose
• To generate a summary of multiple travel
blog entries.
• Our method identifies significant sentences
in addition to the images.
ENTER 2018 Research Track Slide Number 3
Demo
http://165.242.101.30/blogMap/
ENTER 2018 Research Track Slide Number 4
Related work
Travel Information Recommendation
•[Wu 08]: selects and shows medias for each
type of query (What is the historical
background of Tian Tan?)->history category
-> text information
•[Hao 10]: shows representative tags and
snippets for a given destination
We generate a text summary with images
ENTER 2018 Research Track Slide Number 5
Our Summarization System
Input: geographical region and content type
1.Cluster blog entries
2.Calculate the importance of each sentence
and image for each cluster
3.Select three to five important sentences
and images for each cluster
ENTER 2018 Research Track Slide Number 6
Identification of Content Type of
Each Blog Entry (Fujii+2016)
Content type Criterion
Watch Sightseeing for watching enjoyment
Experience Experience (scuba diving, dance)
Buy Shopping or souvenir stores
Dine Drinking and dining
Stay Accommodation
ENTER 2018 Research Track Slide Number 7
LexRank [Erkan 04]
PageRank-base text
summarization
ENTER 2018 Research Track Slide Number 8
x1 x2
x3
P1
P2
P3




















=










3
2
1
3
2
1
012/1
002/1
100
P
P
P
P
P
P
1321 =++ PPP










=










5/2
5/1
5/2
3
2
1
P
P
P
ENTER 2018 Research Track Slide Number 9
x1 x2
x3
P1
P2
P3
ENTER 2018 Research Track Slide Number 10






























−+










=










3
2
1
3
2
1
002/1
002/1
100
)1(
3/13/13/1
3/13/13/1
3/13/13/1
P
P
P
dd
P
P
P
1321 =++ PPP
ENTER 2018 Research Track Slide Number 11
LexRank [Erkan 04]
• Make a graph by
connecting sentences,
whose similarity scores are
higher than a threshold
value
• Then, apply PageRank
algorithm to this graph,
and select important
sentences.
ENTER 2018 Research Track Slide Number 12
• Make a graph by
connecting images,
whose similarity scores
are higher than a
threshold value
• Then, apply PageRank
algorithm to this graph,
and select important
images.
ENTER 2018 Research Track Slide Number 13
• Connect both text
and image graphs
• Then, apply PageRank
algorithm to this
graph, and select
important sentences
and images.
ENTER 2018 Research Track Slide Number 14
Similarity between items
• Similarity between sentences:
tfidf
Cosine distance
• Similarity between images:
Color histogram (HSV)
Bag of Visual Words: SIFT
Cosine distance
ENTER 2018 Research Track Slide Number 15
Summarization taking account of
content types
• Watch: view, beautiful, park…
• Dine: tasty, noodle…
ENTER 2018 Research Track Slide Number 16






























−+










=










3
2
1
3
2
1
002/1
002/1
100
)1(
3/13/13/1
3/13/13/1
3/13/13/1
P
P
P
dd
P
P
P
1321 =++ PPP
ENTER 2018 Research Track Slide Number 17
• a
A sentence that have a strong
relationship with a given
content type
ENTER 2018 Research Track Slide Number 18
Experiments
Data
•Manually created summaries for 20 spots.
Evaluation measure
•ROUGE-N
•Ranking (MANUAL-TEXT, MANUAL-IMAGE-TEXT)
Alternatives
•Lead (baserline)
•LexRank (baseline)
•LR+IMG
•LR+IMG+TYPE
•LR+IMG+TYPE
ENTER 2018 Research Track Slide Number 19
Evaluation by ROUGE-N
(automatic evaluation)
ROUGE-1 ROUGE-2
LexRank (baseline) 0.316 0.207
IR+IMG 0.331 0.227
LR+TYPE 0.345 0.240
IR+IMG+TYPE 0.340 0.237
ENTER 2018 Research Track Slide Number 20
Manual Evaluation
MANUAL-TEXT
Human-produced 1.28
Lead (baseline) 4.01
LexRank (baseline) 3.09
LR+IMG 2.85
LR+TYPE 3.22
IR+IMG+TYPE 2.99
ENTER 2018 Research Track Slide Number 24
Conclusions
• We propose a method of summarizing multiple
travel blog entries.
• By connecting a text and an image graph->
content+image was improved
• By taking account of content type -> more accurate
content type-biased summary
• We also constructed a summarization system
• http://www.ls.info.Hiroshima-cu.ac.jp/blogMap/

More Related Content

Similar to Automatic Summarization of Multiple Travel Blog Entries Focusing on Travelers’ Behavior

BAEB601 Chapter 4 : Findings, Analysis and SPSS
BAEB601 Chapter 4 : Findings, Analysis and SPSSBAEB601 Chapter 4 : Findings, Analysis and SPSS
BAEB601 Chapter 4 : Findings, Analysis and SPSSDr Nur Suhaili Ramli
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingDasha Herrmannova
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxrafaelaj1
 
Course-Adaptive Content Recommender for Course Authoring
Course-Adaptive Content Recommender for Course AuthoringCourse-Adaptive Content Recommender for Course Authoring
Course-Adaptive Content Recommender for Course AuthoringPeter Brusilovsky
 
Multi-level game learning analytics for serious games - VSGames 2018
Multi-level game learning analytics for serious games - VSGames 2018Multi-level game learning analytics for serious games - VSGames 2018
Multi-level game learning analytics for serious games - VSGames 2018Iván Pérez Colado
 
EP-Course Overview-syllabus-R3.pdf
EP-Course Overview-syllabus-R3.pdfEP-Course Overview-syllabus-R3.pdf
EP-Course Overview-syllabus-R3.pdfKwangheeJung
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsMatthias Braunhofer
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationEnrico Palumbo
 
A323568347_29695_5_2024_Lecture0_INT232.ppt
A323568347_29695_5_2024_Lecture0_INT232.pptA323568347_29695_5_2024_Lecture0_INT232.ppt
A323568347_29695_5_2024_Lecture0_INT232.pptCheriviralaNikhil
 
Fyp proposal writing
Fyp proposal writingFyp proposal writing
Fyp proposal writingAsyikin Rahim
 
Dept.Statistics-IPB-University-PPT-Guidelines.pptx
Dept.Statistics-IPB-University-PPT-Guidelines.pptxDept.Statistics-IPB-University-PPT-Guidelines.pptx
Dept.Statistics-IPB-University-PPT-Guidelines.pptxKristaSatyaNugraha
 
Geometric and Semantic Matching for Cultural Heritage Artefacts
Geometric and Semantic Matching for Cultural Heritage ArtefactsGeometric and Semantic Matching for Cultural Heritage Artefacts
Geometric and Semantic Matching for Cultural Heritage ArtefactsGravitate Project
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...KnowledgeGraph
 
Graph processing at scale using spark & graph frames
Graph processing at scale using spark & graph framesGraph processing at scale using spark & graph frames
Graph processing at scale using spark & graph framesRon Barabash
 

Similar to Automatic Summarization of Multiple Travel Blog Entries Focusing on Travelers’ Behavior (20)

Defining Tourism Domains for Semantic Annotation of Web Content (Research Note)
Defining Tourism Domains for Semantic Annotation of Web Content (Research Note)Defining Tourism Domains for Semantic Annotation of Web Content (Research Note)
Defining Tourism Domains for Semantic Annotation of Web Content (Research Note)
 
BAEB601 Chapter 4 : Findings, Analysis and SPSS
BAEB601 Chapter 4 : Findings, Analysis and SPSSBAEB601 Chapter 4 : Findings, Analysis and SPSS
BAEB601 Chapter 4 : Findings, Analysis and SPSS
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
 
Week_3_Lecture.pdf
Week_3_Lecture.pdfWeek_3_Lecture.pdf
Week_3_Lecture.pdf
 
Course-Adaptive Content Recommender for Course Authoring
Course-Adaptive Content Recommender for Course AuthoringCourse-Adaptive Content Recommender for Course Authoring
Course-Adaptive Content Recommender for Course Authoring
 
Multi-level game learning analytics for serious games - VSGames 2018
Multi-level game learning analytics for serious games - VSGames 2018Multi-level game learning analytics for serious games - VSGames 2018
Multi-level game learning analytics for serious games - VSGames 2018
 
EP-Course Overview-syllabus-R3.pdf
EP-Course Overview-syllabus-R3.pdfEP-Course Overview-syllabus-R3.pdf
EP-Course Overview-syllabus-R3.pdf
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender Systems
 
Contextual information elicitation in travel recommender systems
Contextual information elicitation in travel recommender systemsContextual information elicitation in travel recommender systems
Contextual information elicitation in travel recommender systems
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
 
Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...Key factors in the booking activity process: the case of self-catering in Rom...
Key factors in the booking activity process: the case of self-catering in Rom...
 
A323568347_29695_5_2024_Lecture0_INT232.ppt
A323568347_29695_5_2024_Lecture0_INT232.pptA323568347_29695_5_2024_Lecture0_INT232.ppt
A323568347_29695_5_2024_Lecture0_INT232.ppt
 
Fyp proposal writing
Fyp proposal writingFyp proposal writing
Fyp proposal writing
 
Slides ecir2016
Slides ecir2016Slides ecir2016
Slides ecir2016
 
Dept.Statistics-IPB-University-PPT-Guidelines.pptx
Dept.Statistics-IPB-University-PPT-Guidelines.pptxDept.Statistics-IPB-University-PPT-Guidelines.pptx
Dept.Statistics-IPB-University-PPT-Guidelines.pptx
 
Geometric and Semantic Matching for Cultural Heritage Artefacts
Geometric and Semantic Matching for Cultural Heritage ArtefactsGeometric and Semantic Matching for Cultural Heritage Artefacts
Geometric and Semantic Matching for Cultural Heritage Artefacts
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
 
Graph processing at scale using spark & graph frames
Graph processing at scale using spark & graph framesGraph processing at scale using spark & graph frames
Graph processing at scale using spark & graph frames
 
2015 03-28-eb-final
2015 03-28-eb-final2015 03-28-eb-final
2015 03-28-eb-final
 

Recently uploaded

Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...caitlingebhard1
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 

Recently uploaded (20)

Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Automatic Summarization of Multiple Travel Blog Entries Focusing on Travelers’ Behavior

  • 1. ENTER 2018 Research Track Slide Number 1 Automatic Summarization of Multiple Travel Blog Entries Focusing on Travelers’ Behavior Shumpei Iinuma * Hidetugu Nanba Toshiyuki Takezawa Hiroshima City University, JAPAN nanba@Hiroshima-cu.ac.jp http://www.ls.info.Hiroshima-cu.ac.jp/~nanba
  • 2. ENTER 2018 Research Track Slide Number 2 Purpose • To generate a summary of multiple travel blog entries. • Our method identifies significant sentences in addition to the images.
  • 3. ENTER 2018 Research Track Slide Number 3 Demo http://165.242.101.30/blogMap/
  • 4. ENTER 2018 Research Track Slide Number 4 Related work Travel Information Recommendation •[Wu 08]: selects and shows medias for each type of query (What is the historical background of Tian Tan?)->history category -> text information •[Hao 10]: shows representative tags and snippets for a given destination We generate a text summary with images
  • 5. ENTER 2018 Research Track Slide Number 5 Our Summarization System Input: geographical region and content type 1.Cluster blog entries 2.Calculate the importance of each sentence and image for each cluster 3.Select three to five important sentences and images for each cluster
  • 6. ENTER 2018 Research Track Slide Number 6 Identification of Content Type of Each Blog Entry (Fujii+2016) Content type Criterion Watch Sightseeing for watching enjoyment Experience Experience (scuba diving, dance) Buy Shopping or souvenir stores Dine Drinking and dining Stay Accommodation
  • 7. ENTER 2018 Research Track Slide Number 7 LexRank [Erkan 04] PageRank-base text summarization
  • 8. ENTER 2018 Research Track Slide Number 8 x1 x2 x3 P1 P2 P3                     =           3 2 1 3 2 1 012/1 002/1 100 P P P P P P 1321 =++ PPP           =           5/2 5/1 5/2 3 2 1 P P P
  • 9. ENTER 2018 Research Track Slide Number 9 x1 x2 x3 P1 P2 P3
  • 10. ENTER 2018 Research Track Slide Number 10                               −+           =           3 2 1 3 2 1 002/1 002/1 100 )1( 3/13/13/1 3/13/13/1 3/13/13/1 P P P dd P P P 1321 =++ PPP
  • 11. ENTER 2018 Research Track Slide Number 11 LexRank [Erkan 04] • Make a graph by connecting sentences, whose similarity scores are higher than a threshold value • Then, apply PageRank algorithm to this graph, and select important sentences.
  • 12. ENTER 2018 Research Track Slide Number 12 • Make a graph by connecting images, whose similarity scores are higher than a threshold value • Then, apply PageRank algorithm to this graph, and select important images.
  • 13. ENTER 2018 Research Track Slide Number 13 • Connect both text and image graphs • Then, apply PageRank algorithm to this graph, and select important sentences and images.
  • 14. ENTER 2018 Research Track Slide Number 14 Similarity between items • Similarity between sentences: tfidf Cosine distance • Similarity between images: Color histogram (HSV) Bag of Visual Words: SIFT Cosine distance
  • 15. ENTER 2018 Research Track Slide Number 15 Summarization taking account of content types • Watch: view, beautiful, park… • Dine: tasty, noodle…
  • 16. ENTER 2018 Research Track Slide Number 16                               −+           =           3 2 1 3 2 1 002/1 002/1 100 )1( 3/13/13/1 3/13/13/1 3/13/13/1 P P P dd P P P 1321 =++ PPP
  • 17. ENTER 2018 Research Track Slide Number 17 • a A sentence that have a strong relationship with a given content type
  • 18. ENTER 2018 Research Track Slide Number 18 Experiments Data •Manually created summaries for 20 spots. Evaluation measure •ROUGE-N •Ranking (MANUAL-TEXT, MANUAL-IMAGE-TEXT) Alternatives •Lead (baserline) •LexRank (baseline) •LR+IMG •LR+IMG+TYPE •LR+IMG+TYPE
  • 19. ENTER 2018 Research Track Slide Number 19 Evaluation by ROUGE-N (automatic evaluation) ROUGE-1 ROUGE-2 LexRank (baseline) 0.316 0.207 IR+IMG 0.331 0.227 LR+TYPE 0.345 0.240 IR+IMG+TYPE 0.340 0.237
  • 20. ENTER 2018 Research Track Slide Number 20 Manual Evaluation MANUAL-TEXT Human-produced 1.28 Lead (baseline) 4.01 LexRank (baseline) 3.09 LR+IMG 2.85 LR+TYPE 3.22 IR+IMG+TYPE 2.99
  • 21. ENTER 2018 Research Track Slide Number 24 Conclusions • We propose a method of summarizing multiple travel blog entries. • By connecting a text and an image graph-> content+image was improved • By taking account of content type -> more accurate content type-biased summary • We also constructed a summarization system • http://www.ls.info.Hiroshima-cu.ac.jp/blogMap/