SlideShare a Scribd company logo
Deep Learning for
Product Title Summarization
Joan Xiao
Lead Machine Learning Scientist
Figure Eight
Nov. 14 2018
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Our Mission
Figure Eight is the essential Human-in-the-Loop AI platform for data science &
machine learning teams. Our software platform trains, tests, and tunes machine
learning models to make AI work in the real world.
2
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Agenda
• Motivation
• Two deep learning based approaches
• Our implementations
• Results
4
Product Title Summarization
Motivation
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 6
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 8
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Need Short Titles!
9
Original Title:
KROSER Laptop Backpack
Computer Backpack School
Backpack Casual Daypack
Water-Repellent Laptop Bag
with USB Charging Port for
Travel/Business/College/Wo
men/Men Grey
Short Title:
Kroser Laptop
Backpack, Grey
Product Title Summarization
Approach #1
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 11
Original Title:
KROSER Laptop Backpack
Computer Backpack School
Backpack Casual Daypack
Water-Repellent Laptop Bag
with USB Charging Port for
Travel/Business/College/Wo
men/Men Grey
Short Title:
Kroser Laptop
Backpack, Grey
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 12
Original Title:
KROSER Laptop Backpack
Computer Backpack School
Backpack Casual Daypack
Water-Repellent Laptop Bag
with USB Charging Port for
Travel/Business/College/Wom
en/Men Grey
Short Title:
Kroser Laptop Backpack,
Grey
Kroser Brand
Laptop Backpack Function
Grey Variation
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Named Entity Recognition
13
Named entity recognition (NER, also known as entity chunking and entity extraction)
- A subtask of information extraction
- Locates named entities in text and classifies them into pre-defined categories
- Common categories: persons, organizations, locations, quantities, monetary
values, percentages, etc.
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
NER Example
14
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
NER – Traditional Approaches
17
- Rule based: hand-crafted linguistic grammar-based
- Supervised Learning
• Decision Trees
• Maximum Entropy Models
• Support Vector Machines
• Hidden Markov Models
• Conditional Random Fields
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
NER – Open Source APIs
19
- NLTK
- Stanford NLP
- OpenNLP
- SpaCy
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
CNN
28
Kim, Y. (2014). Convolutional Neural Networks for Sentence Classification
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
RNN
29
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
RNN
30
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
RNN
31
Unfolded BLSTM architecture with 3 consecutive steps. From Cui et al. (2017)
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Encoder-decoder RNN
34
http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Encoder-decoder RNN with Attention
35
Neural Machine Translation by Jointly Learning to Align and Translate. From Bahdanau, et al. (2015)
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 36
https://ai.googleblog.com/2016/09/a-neural-network-for-machine.html
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 37
Attention is all you need
Attention is all you need
- Vaswani et. al, 2017
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Pre-trained language models
38
• ELMo
• ULMFiT
• OpenAI Transformer
• BERT
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Deep Learning Results on CoNLL 2003
40
Year Model Architecture Author(s) F1
2018 BiLSTM-CRF+Flair Akbik et al. 93.09
2018 BERT Large Devlin et al. 92.8
2018 CVT + Multi Task Learning Clark et al. 92.6
2018 BERT Base Devlin et al. 92.4
2018 BiLSTM-CRF+ELMo Peters et al. 92.22
2017 GRU-GRU-CRF Yang et al. 91.26
2016 BiLSTM-CNN-CRF Ma and Hovy 91.21
2016 LSTM-LSTM-CRF Lample et al. 90.94
…
2011 CNN-CRF Collbert et al. 88.67
https://github.com/sebastianruder/NLP-progress
Product Title Summarization
Approach #2
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Automatic Text Summarization
42
- The task of producing a concise and fluent summary while
preserving key information content and overall meaning.
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Two Types of Text Summarization
43
- Extractive Summarization
• Extracts key sections of the text, and compose summary without
modifying original text.
- Abstractive Summarization
• Generates a new shorter text that conveys the most critical information
from the original text.
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Text Summarization Example
46
- Original Text: Alice and Bob took the train to visit the zoo. They saw a
baby giraffe, a lion, and a flock of colorful tropical birds.
- Extractive Summary: Alice and Bob visit the zoo. saw a flock of birds.
- Abstractive summary: Alice and Bob visited the zoo and saw animals
and birds.
https://ai.googleblog.com/2016/08/text-summarization-with-tensorflow.html
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Automatic Text Summarization Evaluation Methods
47
- Human Evaluation
- Recall-Oriented Understudy for Gisting Evaluation (ROUGE) 
• Compares a candidate summary to human (reference) summary.
• ROUGE-n: based on comparison of n-grams, where n is 1, 2, 3,
etc. Defined as the number of common n-grams between
candidate and reference summary, divided by the number of n-
grams extracted from the reference summary only.
• ROUGE-L: based on longest common subsequence (LCS) between
the candidate and reference summary.
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Deep Learning Results on Gigaword
(Abstractive Text Summarization )
51
Year Model ROUGE-1 ROUGE-2 ROUGE-L
2018 Re^3 Sum (Cao et al.) 37.04 19.03 34.46
2018 CGU (Lin et al.) 36.3 18.0 33.8
2018 Pointer + Coverage + EntailmentGen +
QuestionGen (Guo et al.)
35.98 17.76 33.63
2018 words-lvt5k-1sent (Nallapti et al.) 36.4 17.7 33.71
2018 Struct+2Way+Word (Song et al.) 35.47 17.66 33.52
https://github.com/sebastianruder/NLP-progress
Product Title Summarization
Implementation
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Implementation Details (NER)
54
- A dataset of about 56K product titles
- Obtain entity labels for NER
- Train NER model
- Predict entities for a title
- Compose a short title from predicted entities
Output KROSER Laptop Backpack, Grey
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Implementation Details (Text Summarization)
55
- Ground truth label for each title is composed of the words corresponding
to the entity tags in NER implementation
- Model predicts short title directly
Output KROSER Laptop Backpack, Grey
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Model Architecture
56
- Bi-directional LSTM encoder/decoder with attention.
• Batch size: 256
• bidirectional encoding layer: 2
• Word embedding size: 128
• LSTM hidden units: 512
• Dropout at embedding and decoder layers: 0.5
• Beam search length: 5
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Evaluation on test set
57
Method ROUGE-1 ROUGE-2
NER 84.71 65.98
Abstract Text Summarization 78.83 47.85
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Human Evaluation
58
• 1000 random samples from test set
• Manual summarization by crowd worker (on Figure Eight)
• Crowd workers rated 4 short titles side-by-side, on the scale from 1 to 10:
o Short title composed from the ground truth label of NER model
o Short title composed from NER model prediction
o Short title from Abstract Text Summarization model
o Human summarization
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Human Evaluation Results (average rating)
59
Ground Truth NER Text Summarization Human Summarization
8.26 ± 1.04 8.16 ± 1.17 6.73 ± 2.24 8.67 ± 1.27
• There is no statistical significance in the difference between the short titles
generated from NER prediction, NER Ground Truth and human summarization.
• The Text Summarization results are significantly below the other 3 versions.
Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved.
Summary
• Reviewed NER and Text Summarization approaches and the latest advancements
• How to summarize product titles using NER and Text Summarization
• Evaluation results show that NER performs much better than Text Summarization
61
Thank You
Joan.xiao@figure-eight.com
https://linked.in/joanxiao

More Related Content

Similar to Deep learning for product title summarization

Sentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics PlatformSentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics Platform
KNIMESlides
 
Interactive and reproducible data analysis with the open-source KNIME Analyti...
Interactive and reproducible data analysis with the open-source KNIME Analyti...Interactive and reproducible data analysis with the open-source KNIME Analyti...
Interactive and reproducible data analysis with the open-source KNIME Analyti...
Greg Landrum
 
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET Journal
 
House Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data AnalysisHouse Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data Analysis
IRJET Journal
 
Ml product page
Ml product pageMl product page
Ml product page
Janu Jahnavi
 
Ml product page
Ml product pageMl product page
Ml product page
Janu Jahnavi
 
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
TigerGraph
 
Freenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning PlatformFreenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning Platform
Brandon White
 
IRJET- Sign Language Converter using OpenCV
IRJET- Sign Language Converter using OpenCVIRJET- Sign Language Converter using OpenCV
IRJET- Sign Language Converter using OpenCV
IRJET Journal
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
Dr. Haxel Consult
 
Optimal Meeting Point Notification for Moving groups of Users in Network Region
Optimal Meeting Point Notification for Moving groups of Users in Network RegionOptimal Meeting Point Notification for Moving groups of Users in Network Region
Optimal Meeting Point Notification for Moving groups of Users in Network Region
IRJET Journal
 
Master the RETE algorithm
Master the RETE algorithmMaster the RETE algorithm
Master the RETE algorithm
Masahiko Umeno
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
ashis deb
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
ashis deb
 
When Deep Learning Meets Recommender System
When Deep Learning Meets Recommender SystemWhen Deep Learning Meets Recommender System
When Deep Learning Meets Recommender System
Asi Messica
 
Webinar: Deep Learning Pipelines Beyond the Learning
Webinar: Deep Learning Pipelines Beyond the LearningWebinar: Deep Learning Pipelines Beyond the Learning
Webinar: Deep Learning Pipelines Beyond the Learning
Mesosphere Inc.
 
REAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTIONREAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTION
IRJET Journal
 
IRJET- Movie Success Prediction using Popularity Factor from Social Media
IRJET- Movie Success Prediction using Popularity Factor from Social MediaIRJET- Movie Success Prediction using Popularity Factor from Social Media
IRJET- Movie Success Prediction using Popularity Factor from Social Media
IRJET Journal
 
Sakshi sharma resume
Sakshi sharma resumeSakshi sharma resume
Sakshi sharma resume
Sakshi Sharma
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
Agnirudra Sikdar
 

Similar to Deep learning for product title summarization (20)

Sentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics PlatformSentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics Platform
 
Interactive and reproducible data analysis with the open-source KNIME Analyti...
Interactive and reproducible data analysis with the open-source KNIME Analyti...Interactive and reproducible data analysis with the open-source KNIME Analyti...
Interactive and reproducible data analysis with the open-source KNIME Analyti...
 
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
 
House Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data AnalysisHouse Price Prediction Using Machine Learning Via Data Analysis
House Price Prediction Using Machine Learning Via Data Analysis
 
Ml product page
Ml product pageMl product page
Ml product page
 
Ml product page
Ml product pageMl product page
Ml product page
 
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
 
Freenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning PlatformFreenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning Platform
 
IRJET- Sign Language Converter using OpenCV
IRJET- Sign Language Converter using OpenCVIRJET- Sign Language Converter using OpenCV
IRJET- Sign Language Converter using OpenCV
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
Optimal Meeting Point Notification for Moving groups of Users in Network Region
Optimal Meeting Point Notification for Moving groups of Users in Network RegionOptimal Meeting Point Notification for Moving groups of Users in Network Region
Optimal Meeting Point Notification for Moving groups of Users in Network Region
 
Master the RETE algorithm
Master the RETE algorithmMaster the RETE algorithm
Master the RETE algorithm
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
 
When Deep Learning Meets Recommender System
When Deep Learning Meets Recommender SystemWhen Deep Learning Meets Recommender System
When Deep Learning Meets Recommender System
 
Webinar: Deep Learning Pipelines Beyond the Learning
Webinar: Deep Learning Pipelines Beyond the LearningWebinar: Deep Learning Pipelines Beyond the Learning
Webinar: Deep Learning Pipelines Beyond the Learning
 
REAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTIONREAL ESTATE PRICE PREDICTION
REAL ESTATE PRICE PREDICTION
 
IRJET- Movie Success Prediction using Popularity Factor from Social Media
IRJET- Movie Success Prediction using Popularity Factor from Social MediaIRJET- Movie Success Prediction using Popularity Factor from Social Media
IRJET- Movie Success Prediction using Popularity Factor from Social Media
 
Sakshi sharma resume
Sakshi sharma resumeSakshi sharma resume
Sakshi sharma resume
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
MLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
MLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
MLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
MLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
MLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
MLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
MLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
MLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 

Recently uploaded (20)

Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 

Deep learning for product title summarization

  • 1. Deep Learning for Product Title Summarization Joan Xiao Lead Machine Learning Scientist Figure Eight Nov. 14 2018
  • 2. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Our Mission Figure Eight is the essential Human-in-the-Loop AI platform for data science & machine learning teams. Our software platform trains, tests, and tunes machine learning models to make AI work in the real world. 2
  • 3. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Agenda • Motivation • Two deep learning based approaches • Our implementations • Results 4
  • 5. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 6
  • 6. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 8
  • 7. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Need Short Titles! 9 Original Title: KROSER Laptop Backpack Computer Backpack School Backpack Casual Daypack Water-Repellent Laptop Bag with USB Charging Port for Travel/Business/College/Wo men/Men Grey Short Title: Kroser Laptop Backpack, Grey
  • 9. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 11 Original Title: KROSER Laptop Backpack Computer Backpack School Backpack Casual Daypack Water-Repellent Laptop Bag with USB Charging Port for Travel/Business/College/Wo men/Men Grey Short Title: Kroser Laptop Backpack, Grey
  • 10. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 12 Original Title: KROSER Laptop Backpack Computer Backpack School Backpack Casual Daypack Water-Repellent Laptop Bag with USB Charging Port for Travel/Business/College/Wom en/Men Grey Short Title: Kroser Laptop Backpack, Grey Kroser Brand Laptop Backpack Function Grey Variation
  • 11. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Named Entity Recognition 13 Named entity recognition (NER, also known as entity chunking and entity extraction) - A subtask of information extraction - Locates named entities in text and classifies them into pre-defined categories - Common categories: persons, organizations, locations, quantities, monetary values, percentages, etc.
  • 12. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. NER Example 14
  • 13. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. NER – Traditional Approaches 17 - Rule based: hand-crafted linguistic grammar-based - Supervised Learning • Decision Trees • Maximum Entropy Models • Support Vector Machines • Hidden Markov Models • Conditional Random Fields
  • 14. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. NER – Open Source APIs 19 - NLTK - Stanford NLP - OpenNLP - SpaCy
  • 15. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. CNN 28 Kim, Y. (2014). Convolutional Neural Networks for Sentence Classification
  • 16. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. RNN 29
  • 17. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. RNN 30 http://colah.github.io/posts/2015-08-Understanding-LSTMs/
  • 18. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. RNN 31 Unfolded BLSTM architecture with 3 consecutive steps. From Cui et al. (2017) http://colah.github.io/posts/2015-08-Understanding-LSTMs/
  • 19. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Encoder-decoder RNN 34 http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/
  • 20. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Encoder-decoder RNN with Attention 35 Neural Machine Translation by Jointly Learning to Align and Translate. From Bahdanau, et al. (2015)
  • 21. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 36 https://ai.googleblog.com/2016/09/a-neural-network-for-machine.html
  • 22. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. 37 Attention is all you need Attention is all you need - Vaswani et. al, 2017
  • 23. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Pre-trained language models 38 • ELMo • ULMFiT • OpenAI Transformer • BERT
  • 24. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Deep Learning Results on CoNLL 2003 40 Year Model Architecture Author(s) F1 2018 BiLSTM-CRF+Flair Akbik et al. 93.09 2018 BERT Large Devlin et al. 92.8 2018 CVT + Multi Task Learning Clark et al. 92.6 2018 BERT Base Devlin et al. 92.4 2018 BiLSTM-CRF+ELMo Peters et al. 92.22 2017 GRU-GRU-CRF Yang et al. 91.26 2016 BiLSTM-CNN-CRF Ma and Hovy 91.21 2016 LSTM-LSTM-CRF Lample et al. 90.94 … 2011 CNN-CRF Collbert et al. 88.67 https://github.com/sebastianruder/NLP-progress
  • 26. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Automatic Text Summarization 42 - The task of producing a concise and fluent summary while preserving key information content and overall meaning.
  • 27. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Two Types of Text Summarization 43 - Extractive Summarization • Extracts key sections of the text, and compose summary without modifying original text. - Abstractive Summarization • Generates a new shorter text that conveys the most critical information from the original text.
  • 28. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Text Summarization Example 46 - Original Text: Alice and Bob took the train to visit the zoo. They saw a baby giraffe, a lion, and a flock of colorful tropical birds. - Extractive Summary: Alice and Bob visit the zoo. saw a flock of birds. - Abstractive summary: Alice and Bob visited the zoo and saw animals and birds. https://ai.googleblog.com/2016/08/text-summarization-with-tensorflow.html
  • 29. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Automatic Text Summarization Evaluation Methods 47 - Human Evaluation - Recall-Oriented Understudy for Gisting Evaluation (ROUGE)  • Compares a candidate summary to human (reference) summary. • ROUGE-n: based on comparison of n-grams, where n is 1, 2, 3, etc. Defined as the number of common n-grams between candidate and reference summary, divided by the number of n- grams extracted from the reference summary only. • ROUGE-L: based on longest common subsequence (LCS) between the candidate and reference summary.
  • 30. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Deep Learning Results on Gigaword (Abstractive Text Summarization ) 51 Year Model ROUGE-1 ROUGE-2 ROUGE-L 2018 Re^3 Sum (Cao et al.) 37.04 19.03 34.46 2018 CGU (Lin et al.) 36.3 18.0 33.8 2018 Pointer + Coverage + EntailmentGen + QuestionGen (Guo et al.) 35.98 17.76 33.63 2018 words-lvt5k-1sent (Nallapti et al.) 36.4 17.7 33.71 2018 Struct+2Way+Word (Song et al.) 35.47 17.66 33.52 https://github.com/sebastianruder/NLP-progress
  • 32. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Implementation Details (NER) 54 - A dataset of about 56K product titles - Obtain entity labels for NER - Train NER model - Predict entities for a title - Compose a short title from predicted entities Output KROSER Laptop Backpack, Grey
  • 33. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Implementation Details (Text Summarization) 55 - Ground truth label for each title is composed of the words corresponding to the entity tags in NER implementation - Model predicts short title directly Output KROSER Laptop Backpack, Grey
  • 34. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Model Architecture 56 - Bi-directional LSTM encoder/decoder with attention. • Batch size: 256 • bidirectional encoding layer: 2 • Word embedding size: 128 • LSTM hidden units: 512 • Dropout at embedding and decoder layers: 0.5 • Beam search length: 5
  • 35. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Evaluation on test set 57 Method ROUGE-1 ROUGE-2 NER 84.71 65.98 Abstract Text Summarization 78.83 47.85
  • 36. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Human Evaluation 58 • 1000 random samples from test set • Manual summarization by crowd worker (on Figure Eight) • Crowd workers rated 4 short titles side-by-side, on the scale from 1 to 10: o Short title composed from the ground truth label of NER model o Short title composed from NER model prediction o Short title from Abstract Text Summarization model o Human summarization
  • 37. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Human Evaluation Results (average rating) 59 Ground Truth NER Text Summarization Human Summarization 8.26 ± 1.04 8.16 ± 1.17 6.73 ± 2.24 8.67 ± 1.27 • There is no statistical significance in the difference between the short titles generated from NER prediction, NER Ground Truth and human summarization. • The Text Summarization results are significantly below the other 3 versions.
  • 38. Proprietary and Confidential - Do Not Distribute I © 2018 Figure Eight. All Rights Reserved. Summary • Reviewed NER and Text Summarization approaches and the latest advancements • How to summarize product titles using NER and Text Summarization • Evaluation results show that NER performs much better than Text Summarization 61