SlideShare a Scribd company logo
1 of 28
Download to read offline
Learning Contextual Representations
for Semantic Parsing with
Generation-Augmented Pre-Training
2021.01.24
딥러닝 논문읽기모임 자연어처리팀
팀원 : 황소현(발표자), 박희수, 신동진, 진명훈
https://arxiv.org/pdf/2012.10309v1.pdf
Introduction
Models
Experiments
- Motivation
- Pain Point
- Pre-training
- Pre-training Data Generation
- Tasks, Datasets and Baseline Systems
- Impact of Learning Objectives
- Analysis of Pre-Training Inputs
- Analysis of Pre-trained Model
Introduction - Motivation
Text-to-SQL
기존의 MLM을 text-to-SQL에 적용하는 것에는 크게 세가지 문제가 있다
Introduction - Pain Point1
Pain Point 1. Text 에서 컬럼의 이름이 발화된 것을 탐지하는 것에 실패하는 문제
Introduction - Pain Point2
Pain Point 2. 셀 값으로 부터 컬럼의 이름을 추론하는 것에 실패하는 문제
(e.g. republics 단어로 부터 GovernmentForm 컬럼 명을 추론하는데 실패함)
Introduction - Pain Point3
Pain Point 3. 복잡한 SQL 생성문 실패
(e.g. 위의 구문에서 semester_id는 괄호 안 SELECT에 포함되어야함)
Introduction - Motivation
이 논문에선 Alignment 뿐 아니라 컬럼 이름의 “Contextual Representation” 을 학습함으로써 더 성
능을 높이는 방법을 제안한다.
(1) Column prediction task (CPed)
각각의 컬럼에 Label index 를 붙이고 utterance 에서 사용이 되었는지 아닌지를 prediction 하는
task.
(2) Column recovery task (CRec)
Text 에서 컬럼 명을 컬럼의 value 로 대체하고 컬럼 명을 추론하는 task.
(3) SQL generation (GenSQL)
utterance 와 schema 를 주고 SQL 문을 생성하는 task. (이 때 synthetic data 를 통하여 많은 양
의 data 를 확보 후에 이를 바탕으로 학습시킴)
Introduction - Motivation
⚫ Main contributions
✓ 3가지의 기존 문제점 (3 pain point)
✓ multiple pre-training tasks and synthetic data 제안
✓ 3가지의 learning objectives that alleviate the three main issues spotted with
pre-trained LMs for semantic parsing.
✓ leveraging SQL-to-Text and Table-to-Text generative models to generate
synthetic data for learning joint representations of textual data and table
schema.
Models - GAP framework
Models - Pre-training
schema S = tables T = {t1, t2, ..., t∣T∣}
columns C = {c1, c2, ..., c∣C∣}
utterance U = {x1, x2, ..., xn}
Models - Text-to-SQL semantic parser
Utterance (raw text)
Schema
Columns Tables
Transformer Encoder
Transformer Decoder
Contextual
Representation
1. Column Prediction (CPred)
2. Column Recovery (CRec)
3. SQL Generation (GenSQL)
4. Masked Language Model (MLM)
Utterance Schema SQL
Models - Text-to-SQL semantic parser
Utterance (raw text)
Schema
Columns Tables
Transformer Encoder
Transformer Decoder
Contextual
Representation
1. Column Prediction (CPred)
MLP
Column in
Utterance?
Utterance Schema SQL
Models - Text-to-SQL semantic parser
Utterance (raw text)
Schema
Columns Tables
Transformer Encoder
Transformer Decoder
Contextual
Representation
2. Column Recovery (CRec)
Column
name
first_name last_name job
Cell value
(record)
Blake Smith manager
first_name last_name job
first_name last_name manager
replacement
Utterance Schema SQL
Models - Text-to-SQL semantic parser
Utterance (raw text)
Schema
Columns Tables
Transformer Encoder
Transformer Decoder
Contextual
Representation
3. SQL Generation (GenSQL)
Utterance Schema SQL
Models - Text-to-SQL semantic parser
Utterance (Masked)
Schema
Columns
(Masked)
Tables
Transformer Encoder
Transformer Decoder
Contextual
Representation
4. Masked Language Model (MLM)
Utterance Schema SQL
Models - Pre-training Data Generation
Models - Pre-training Data Generation
1. Utterance - Schema – SQL (SQL–to–Text Generation)
⚫ SQL (crawl from Github)
→ BART tokenizer
→ SQL-to-Text (train from SPIDER)
→ Utterance
⚫ Result
→ Utterance
→ Schema (columns & tables)
→ SQL
Models - Pre-training Data Generation
2. Utterance – Table (Table–to–Text Generation)
⚫ Table (from English Wikipedia)
→ sample columns
→ BART generator (train from SPIDER)
→ SQL + control codes
⚫ Result
→ Utterance
→ Table
Questions?
Experiments - Tasks, Datasets and Baseline Systems
⚫ SPIDER
SPIDER dataset : Yu et al. 2018,https://www.aclweb.org/anthology/D18-1425/
✓ Annotated 10,181개의 parallel utterance-database-SQL
triples
✓ Metric은 Exact Set Match Accuracy
✓ Baseline: RAT-SQL
8-layer relation-aware transformer를 활용, 테이블 및 발화간
의 연결을 모델링(Wang et al. 2019, https://www.aclweb.org/anthology/2020.acl-
main.677/)
✓ Ablation Study를 위해 IRNet도 추가로 사용
(Gue et al. 2019, https://arxiv.org/abs/1905.08205)
✓ 위 baseline의 original context encoder를 대체하여 GAP
Model의 Encoder를 기본 parser로 확장
Experiments - Tasks, Datasets and Baseline Systems
⚫ Spider Results
Synchronized CFG grammar 사용
Large Scale data-to-text dataset 사용
더 많은 Parameter와 Crowd, Expert Annotated 작업 필요!
Experiments - Tasks, Datasets and Baseline Systems
⚫ Criteria-to-SQL
✓ Electronic Health Record DB에서 임상 시험에 적합한 환자를 쉽게 검색할 수 있는 Dataset
Task는 적합 판정 text를 SQL Queries로 변환시키는 것
✓ Annotated 2003 examples
✓ Metric은 SQL Accuracy와 Execution Accurarcy
✓ Baseline은 Yu et al. 2020c https://www.aclweb.org/anthology/2020.lrec-1.714/에 나온 system 사용
✓ YXJ model이라 부를 것이며 sketch를 디자인하기 위해 사전 문법 지식을 활용하는 slot-filling 기반
모델이라고 함 (BERT 기반을 context encoder로 사용)
“” Any infection requiring parenteral antibiotic therapy or causing fever (i.e.,
temperature > 100.5f) <= 7 days prior to registration “””
SELECT id
FROM records
WHERE active_infection = 1
AND (parenteral_antibiotic_therapy = 1
OR causing_fever = 1 OR temperature > 100.5)
TEXT
SQL
Experiments - Tasks, Datasets and Baseline Systems
⚫ Criteria-to-SQL Results
Experiments - Impact of Learning Objectives
⚫ Four different learning objectives
✓ Masked Language Model (MLM)
✓ Column Prediction (CPred)
✓ Column Recovery (CRec)
✓ SQL Generation (GenSQL)
⚫ Two different conditions
✓ GenSQL learning objective
and the other one is without
Experiments - Analysis of Pre-Training Inputs
⚫ Whether to use utterance in pre-training
✓ MLM → +0.2%
✓ MLM without utterance → -1.7%
✓ CRec without utterance → -1.7%
⚫ Whether to use schema in pre-training
✓ MLM without schema → -1.6%
☞ 작업에 공동 발화 및 스키마 표현을 학습해야 함.
⚫ Whether to use the generated text or the surrounding text of the table
✓ CRec → -0.8%
Experiments - Analysis of Pre-trained Model
Conclusion
⚫ SPIDER
⚫ CRITERIA-TO-SQL
Questions?

More Related Content

Similar to Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Overview of query evaluation
Overview of query evaluationOverview of query evaluation
Overview of query evaluationavniS
 
Analysing Performance of Algorithmic SQL and PLSQL.pptx
Analysing Performance of Algorithmic SQL and PLSQL.pptxAnalysing Performance of Algorithmic SQL and PLSQL.pptx
Analysing Performance of Algorithmic SQL and PLSQL.pptxBrendan Furey
 
NOSQL and Cassandra
NOSQL and CassandraNOSQL and Cassandra
NOSQL and Cassandrarantav
 
Sedna XML Database: Query Parser & Optimizing Rewriter
Sedna XML Database: Query Parser & Optimizing RewriterSedna XML Database: Query Parser & Optimizing Rewriter
Sedna XML Database: Query Parser & Optimizing RewriterIvan Shcheklein
 
DSA 1- Introduction.pdf
DSA 1- Introduction.pdfDSA 1- Introduction.pdf
DSA 1- Introduction.pdfAliyanAbbas1
 
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019Mail.ru Group
 
Arrays and lists in sql server 2008
Arrays and lists in sql server 2008Arrays and lists in sql server 2008
Arrays and lists in sql server 2008nxthuong
 
Extracting data from text documents using the regex
Extracting data from text documents using the regexExtracting data from text documents using the regex
Extracting data from text documents using the regexSteve Mylroie
 
Automated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsAutomated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsNat Rice
 
Information Extraction from HTML: General Machine Learning ...
Information Extraction from HTML: General Machine Learning ...Information Extraction from HTML: General Machine Learning ...
Information Extraction from HTML: General Machine Learning ...butest
 
Ms sql server architecture
Ms sql server architectureMs sql server architecture
Ms sql server architectureAjeet Singh
 
Oracle sql tutorial
Oracle sql tutorialOracle sql tutorial
Oracle sql tutorialMohd Tousif
 
MIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome MeasuresMIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome MeasuresSteven Johnson
 
T3 mp tick and flick
T3 mp tick and flickT3 mp tick and flick
T3 mp tick and flickhccit
 
SQL Server Select Topics
SQL Server Select TopicsSQL Server Select Topics
SQL Server Select TopicsJay Coskey
 
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...TAISEEREISA
 
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...united global soft
 

Similar to Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training (20)

Overview of query evaluation
Overview of query evaluationOverview of query evaluation
Overview of query evaluation
 
Analysing Performance of Algorithmic SQL and PLSQL.pptx
Analysing Performance of Algorithmic SQL and PLSQL.pptxAnalysing Performance of Algorithmic SQL and PLSQL.pptx
Analysing Performance of Algorithmic SQL and PLSQL.pptx
 
NOSQL and Cassandra
NOSQL and CassandraNOSQL and Cassandra
NOSQL and Cassandra
 
Sedna XML Database: Query Parser & Optimizing Rewriter
Sedna XML Database: Query Parser & Optimizing RewriterSedna XML Database: Query Parser & Optimizing Rewriter
Sedna XML Database: Query Parser & Optimizing Rewriter
 
DSA 1- Introduction.pdf
DSA 1- Introduction.pdfDSA 1- Introduction.pdf
DSA 1- Introduction.pdf
 
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019
Оформление пайплайна в NLP проекте​, Виталий Радченко. 22 июня, 2019
 
Arrays and lists in sql server 2008
Arrays and lists in sql server 2008Arrays and lists in sql server 2008
Arrays and lists in sql server 2008
 
Day5
Day5Day5
Day5
 
Extracting data from text documents using the regex
Extracting data from text documents using the regexExtracting data from text documents using the regex
Extracting data from text documents using the regex
 
Automated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsAutomated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language Models
 
Information Extraction from HTML: General Machine Learning ...
Information Extraction from HTML: General Machine Learning ...Information Extraction from HTML: General Machine Learning ...
Information Extraction from HTML: General Machine Learning ...
 
Ms sql server architecture
Ms sql server architectureMs sql server architecture
Ms sql server architecture
 
Oracle sql tutorial
Oracle sql tutorialOracle sql tutorial
Oracle sql tutorial
 
SQL
SQLSQL
SQL
 
MIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome MeasuresMIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome Measures
 
Sql lite android
Sql lite androidSql lite android
Sql lite android
 
T3 mp tick and flick
T3 mp tick and flickT3 mp tick and flick
T3 mp tick and flick
 
SQL Server Select Topics
SQL Server Select TopicsSQL Server Select Topics
SQL Server Select Topics
 
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...
Chapter 3.pptx Oracle SQL or local Android database setup SQL, SQL-Lite, codi...
 
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...
Oracle DBA Online Training in India,Oracle DBA Training in USA,Oracle DBA Tra...
 

More from taeseon ryu

OpineSum Entailment-based self-training for abstractive opinion summarization...
OpineSum Entailment-based self-training for abstractive opinion summarization...OpineSum Entailment-based self-training for abstractive opinion summarization...
OpineSum Entailment-based self-training for abstractive opinion summarization...taeseon ryu
 
3D Gaussian Splatting
3D Gaussian Splatting3D Gaussian Splatting
3D Gaussian Splattingtaeseon ryu
 
Hyperbolic Image Embedding.pptx
Hyperbolic  Image Embedding.pptxHyperbolic  Image Embedding.pptx
Hyperbolic Image Embedding.pptxtaeseon ryu
 
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정taeseon ryu
 
LLaMA Open and Efficient Foundation Language Models - 230528.pdf
LLaMA Open and Efficient Foundation Language Models - 230528.pdfLLaMA Open and Efficient Foundation Language Models - 230528.pdf
LLaMA Open and Efficient Foundation Language Models - 230528.pdftaeseon ryu
 
Dataset Distillation by Matching Training Trajectories
Dataset Distillation by Matching Training Trajectories Dataset Distillation by Matching Training Trajectories
Dataset Distillation by Matching Training Trajectories taeseon ryu
 
Packed Levitated Marker for Entity and Relation Extraction
Packed Levitated Marker for Entity and Relation ExtractionPacked Levitated Marker for Entity and Relation Extraction
Packed Levitated Marker for Entity and Relation Extractiontaeseon ryu
 
MOReL: Model-Based Offline Reinforcement Learning
MOReL: Model-Based Offline Reinforcement LearningMOReL: Model-Based Offline Reinforcement Learning
MOReL: Model-Based Offline Reinforcement Learningtaeseon ryu
 
Scaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language ModelsScaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language Modelstaeseon ryu
 
Visual prompt tuning
Visual prompt tuningVisual prompt tuning
Visual prompt tuningtaeseon ryu
 
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdfvariBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdftaeseon ryu
 
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdfReinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdftaeseon ryu
 
The Forward-Forward Algorithm
The Forward-Forward AlgorithmThe Forward-Forward Algorithm
The Forward-Forward Algorithmtaeseon ryu
 
Towards Robust and Reproducible Active Learning using Neural Networks
Towards Robust and Reproducible Active Learning using Neural NetworksTowards Robust and Reproducible Active Learning using Neural Networks
Towards Robust and Reproducible Active Learning using Neural Networkstaeseon ryu
 
BRIO: Bringing Order to Abstractive Summarization
BRIO: Bringing Order to Abstractive SummarizationBRIO: Bringing Order to Abstractive Summarization
BRIO: Bringing Order to Abstractive Summarizationtaeseon ryu
 

More from taeseon ryu (20)

VoxelNet
VoxelNetVoxelNet
VoxelNet
 
OpineSum Entailment-based self-training for abstractive opinion summarization...
OpineSum Entailment-based self-training for abstractive opinion summarization...OpineSum Entailment-based self-training for abstractive opinion summarization...
OpineSum Entailment-based self-training for abstractive opinion summarization...
 
3D Gaussian Splatting
3D Gaussian Splatting3D Gaussian Splatting
3D Gaussian Splatting
 
JetsonTX2 Python
 JetsonTX2 Python  JetsonTX2 Python
JetsonTX2 Python
 
Hyperbolic Image Embedding.pptx
Hyperbolic  Image Embedding.pptxHyperbolic  Image Embedding.pptx
Hyperbolic Image Embedding.pptx
 
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정
MCSE_Multimodal Contrastive Learning of Sentence Embeddings_변현정
 
LLaMA Open and Efficient Foundation Language Models - 230528.pdf
LLaMA Open and Efficient Foundation Language Models - 230528.pdfLLaMA Open and Efficient Foundation Language Models - 230528.pdf
LLaMA Open and Efficient Foundation Language Models - 230528.pdf
 
YOLO V6
YOLO V6YOLO V6
YOLO V6
 
Dataset Distillation by Matching Training Trajectories
Dataset Distillation by Matching Training Trajectories Dataset Distillation by Matching Training Trajectories
Dataset Distillation by Matching Training Trajectories
 
RL_UpsideDown
RL_UpsideDownRL_UpsideDown
RL_UpsideDown
 
Packed Levitated Marker for Entity and Relation Extraction
Packed Levitated Marker for Entity and Relation ExtractionPacked Levitated Marker for Entity and Relation Extraction
Packed Levitated Marker for Entity and Relation Extraction
 
MOReL: Model-Based Offline Reinforcement Learning
MOReL: Model-Based Offline Reinforcement LearningMOReL: Model-Based Offline Reinforcement Learning
MOReL: Model-Based Offline Reinforcement Learning
 
Scaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language ModelsScaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language Models
 
Visual prompt tuning
Visual prompt tuningVisual prompt tuning
Visual prompt tuning
 
mPLUG
mPLUGmPLUG
mPLUG
 
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdfvariBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf
variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf
 
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdfReinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf
Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf
 
The Forward-Forward Algorithm
The Forward-Forward AlgorithmThe Forward-Forward Algorithm
The Forward-Forward Algorithm
 
Towards Robust and Reproducible Active Learning using Neural Networks
Towards Robust and Reproducible Active Learning using Neural NetworksTowards Robust and Reproducible Active Learning using Neural Networks
Towards Robust and Reproducible Active Learning using Neural Networks
 
BRIO: Bringing Order to Abstractive Summarization
BRIO: Bringing Order to Abstractive SummarizationBRIO: Bringing Order to Abstractive Summarization
BRIO: Bringing Order to Abstractive Summarization
 

Recently uploaded

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

  • 1. Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training 2021.01.24 딥러닝 논문읽기모임 자연어처리팀 팀원 : 황소현(발표자), 박희수, 신동진, 진명훈 https://arxiv.org/pdf/2012.10309v1.pdf
  • 2. Introduction Models Experiments - Motivation - Pain Point - Pre-training - Pre-training Data Generation - Tasks, Datasets and Baseline Systems - Impact of Learning Objectives - Analysis of Pre-Training Inputs - Analysis of Pre-trained Model
  • 3. Introduction - Motivation Text-to-SQL 기존의 MLM을 text-to-SQL에 적용하는 것에는 크게 세가지 문제가 있다
  • 4. Introduction - Pain Point1 Pain Point 1. Text 에서 컬럼의 이름이 발화된 것을 탐지하는 것에 실패하는 문제
  • 5. Introduction - Pain Point2 Pain Point 2. 셀 값으로 부터 컬럼의 이름을 추론하는 것에 실패하는 문제 (e.g. republics 단어로 부터 GovernmentForm 컬럼 명을 추론하는데 실패함)
  • 6. Introduction - Pain Point3 Pain Point 3. 복잡한 SQL 생성문 실패 (e.g. 위의 구문에서 semester_id는 괄호 안 SELECT에 포함되어야함)
  • 7. Introduction - Motivation 이 논문에선 Alignment 뿐 아니라 컬럼 이름의 “Contextual Representation” 을 학습함으로써 더 성 능을 높이는 방법을 제안한다. (1) Column prediction task (CPed) 각각의 컬럼에 Label index 를 붙이고 utterance 에서 사용이 되었는지 아닌지를 prediction 하는 task. (2) Column recovery task (CRec) Text 에서 컬럼 명을 컬럼의 value 로 대체하고 컬럼 명을 추론하는 task. (3) SQL generation (GenSQL) utterance 와 schema 를 주고 SQL 문을 생성하는 task. (이 때 synthetic data 를 통하여 많은 양 의 data 를 확보 후에 이를 바탕으로 학습시킴)
  • 8. Introduction - Motivation ⚫ Main contributions ✓ 3가지의 기존 문제점 (3 pain point) ✓ multiple pre-training tasks and synthetic data 제안 ✓ 3가지의 learning objectives that alleviate the three main issues spotted with pre-trained LMs for semantic parsing. ✓ leveraging SQL-to-Text and Table-to-Text generative models to generate synthetic data for learning joint representations of textual data and table schema.
  • 9. Models - GAP framework
  • 10. Models - Pre-training schema S = tables T = {t1, t2, ..., t∣T∣} columns C = {c1, c2, ..., c∣C∣} utterance U = {x1, x2, ..., xn}
  • 11. Models - Text-to-SQL semantic parser Utterance (raw text) Schema Columns Tables Transformer Encoder Transformer Decoder Contextual Representation 1. Column Prediction (CPred) 2. Column Recovery (CRec) 3. SQL Generation (GenSQL) 4. Masked Language Model (MLM) Utterance Schema SQL
  • 12. Models - Text-to-SQL semantic parser Utterance (raw text) Schema Columns Tables Transformer Encoder Transformer Decoder Contextual Representation 1. Column Prediction (CPred) MLP Column in Utterance? Utterance Schema SQL
  • 13. Models - Text-to-SQL semantic parser Utterance (raw text) Schema Columns Tables Transformer Encoder Transformer Decoder Contextual Representation 2. Column Recovery (CRec) Column name first_name last_name job Cell value (record) Blake Smith manager first_name last_name job first_name last_name manager replacement Utterance Schema SQL
  • 14. Models - Text-to-SQL semantic parser Utterance (raw text) Schema Columns Tables Transformer Encoder Transformer Decoder Contextual Representation 3. SQL Generation (GenSQL) Utterance Schema SQL
  • 15. Models - Text-to-SQL semantic parser Utterance (Masked) Schema Columns (Masked) Tables Transformer Encoder Transformer Decoder Contextual Representation 4. Masked Language Model (MLM) Utterance Schema SQL
  • 16. Models - Pre-training Data Generation
  • 17. Models - Pre-training Data Generation 1. Utterance - Schema – SQL (SQL–to–Text Generation) ⚫ SQL (crawl from Github) → BART tokenizer → SQL-to-Text (train from SPIDER) → Utterance ⚫ Result → Utterance → Schema (columns & tables) → SQL
  • 18. Models - Pre-training Data Generation 2. Utterance – Table (Table–to–Text Generation) ⚫ Table (from English Wikipedia) → sample columns → BART generator (train from SPIDER) → SQL + control codes ⚫ Result → Utterance → Table
  • 20. Experiments - Tasks, Datasets and Baseline Systems ⚫ SPIDER SPIDER dataset : Yu et al. 2018,https://www.aclweb.org/anthology/D18-1425/ ✓ Annotated 10,181개의 parallel utterance-database-SQL triples ✓ Metric은 Exact Set Match Accuracy ✓ Baseline: RAT-SQL 8-layer relation-aware transformer를 활용, 테이블 및 발화간 의 연결을 모델링(Wang et al. 2019, https://www.aclweb.org/anthology/2020.acl- main.677/) ✓ Ablation Study를 위해 IRNet도 추가로 사용 (Gue et al. 2019, https://arxiv.org/abs/1905.08205) ✓ 위 baseline의 original context encoder를 대체하여 GAP Model의 Encoder를 기본 parser로 확장
  • 21. Experiments - Tasks, Datasets and Baseline Systems ⚫ Spider Results Synchronized CFG grammar 사용 Large Scale data-to-text dataset 사용 더 많은 Parameter와 Crowd, Expert Annotated 작업 필요!
  • 22. Experiments - Tasks, Datasets and Baseline Systems ⚫ Criteria-to-SQL ✓ Electronic Health Record DB에서 임상 시험에 적합한 환자를 쉽게 검색할 수 있는 Dataset Task는 적합 판정 text를 SQL Queries로 변환시키는 것 ✓ Annotated 2003 examples ✓ Metric은 SQL Accuracy와 Execution Accurarcy ✓ Baseline은 Yu et al. 2020c https://www.aclweb.org/anthology/2020.lrec-1.714/에 나온 system 사용 ✓ YXJ model이라 부를 것이며 sketch를 디자인하기 위해 사전 문법 지식을 활용하는 slot-filling 기반 모델이라고 함 (BERT 기반을 context encoder로 사용) “” Any infection requiring parenteral antibiotic therapy or causing fever (i.e., temperature > 100.5f) <= 7 days prior to registration “”” SELECT id FROM records WHERE active_infection = 1 AND (parenteral_antibiotic_therapy = 1 OR causing_fever = 1 OR temperature > 100.5) TEXT SQL
  • 23. Experiments - Tasks, Datasets and Baseline Systems ⚫ Criteria-to-SQL Results
  • 24. Experiments - Impact of Learning Objectives ⚫ Four different learning objectives ✓ Masked Language Model (MLM) ✓ Column Prediction (CPred) ✓ Column Recovery (CRec) ✓ SQL Generation (GenSQL) ⚫ Two different conditions ✓ GenSQL learning objective and the other one is without
  • 25. Experiments - Analysis of Pre-Training Inputs ⚫ Whether to use utterance in pre-training ✓ MLM → +0.2% ✓ MLM without utterance → -1.7% ✓ CRec without utterance → -1.7% ⚫ Whether to use schema in pre-training ✓ MLM without schema → -1.6% ☞ 작업에 공동 발화 및 스키마 표현을 학습해야 함. ⚫ Whether to use the generated text or the surrounding text of the table ✓ CRec → -0.8%
  • 26. Experiments - Analysis of Pre-trained Model