SlideShare a Scribd company logo
1
Understanding and Assessing Language
with Neural Network Models
Marek Rei
2
Automated Language Assessment
The number of people learning English around the world is currently
estimated at 1.5 billion and is predicted to exceed 1.9 billion by 2020.
Advantages for students:
• Immediate grades and feedback
• Enables self-assessment and self-tutoring
• Constant availability as an online tool
Advantages for teachers/examiners:
• Reduced teacher/examiner workload
• Can focus on more interesting or difficult content
• Cost-effective approach to assessment
3
Automated Language Assessment
Dear Mrs Brown,
I am writing you because my class want to give a
surprise birthday party for your husband Mr Brown. We
need your help for the details.
First of all could you let us know if the date of June 16th
is all right with his timetable program. We have
organised to do the party between three to six o'clock in
afternoon in College Canteen, about food we organised
a buffet, but could you also help us with the music which
he prefer, if prefer something especialy. We have invite
the student, the teachers and the Principal of school but
we appreciate if you are coming. At last would you tell
us which is the best present for him a compact disk or a
book .
We want say thanks again for your help and you must
be sure that your opinion it would be valuable to us.
I am looking forward to receiving your answer and don't
forget that it is a surprice birthday party.
Yours faithfuly,
Tom
Evaluation:
● Detect any writing errors
● Calculate a holistic writing score
● Predict language proficiency score
(IELTS, FCE)
● Detailed analytic scores (e.g.,
coherence, topic relevance)
Guidance:
● Show detailed progress reports
● Provide corrections for errors
● Suggest areas to focus on
● Generate suitable exercises
4
Talk Overview
Error Detection
Identifying the locations of grammatical errors
01
Error Correction
Providing an edited version of an incorrect sentence
02
Applications and Future Directions
How do we make this useful and where do we go next
04
Essay Scoring
Estimating a language proficiency score based on the full text
03
5
+ DTAL
+ Engineering
+ Cambridge
English
6
Error Detection
7
I want to thak you for preparing such a nice evening .
Error Detection in Learner Writing
8
Error Types in Learner Writing
9
I want to thak you for preparing such a nice evening .
Error Detection in Learner Writing
Spelling error (8.6%)
I know how to cook some things like potatoes .
Missing punctuation (7.4%)
If you have time , why don’t you meet up .
Incorrect punctuation (7.1%)
I’m looking forward to seeing you and good luck to your project .
Incorrect preposition (6.3%)
My friend eats two ice creams yesterday .
Verb tense error (6.0%)
10
We can invite also people who are not members .
Error Detection in Learner Writing
Word order error (2.8%)
The main material that have been used is dark green glass .
Verb agreement error (1.6%)
I thing you should better save your money .
Spelling error produces a valid word (1.5%)
And at last but not the least , Captain Davidson showed him ...
Incorrectly reproduced idiom (0.5%)
Specially the old castle Wawel's great .
Complex error (0.5%)
11
Automated Error Detection
1. Experts have hand-annotated a
large dataset of learner essays,
marking the location of each error.
2. We create algorithms that can look
at all these examples and discover
regularities through machine
learning.
3. We apply the resulting models on
new data, where they are able to
provide predictions.
12
Deep Learning and Neural Networks
• Highly-connected networks of
parameters
• Randomly initialised, but optimised for a
specific task during training
• Automatically discovering features that
are useful for the task
• Each layer is a function of the previous
layer
• Have achieved state-of-the-art results on
nearly all language processing tasks
13
Neural Error Detection
Marek Rei and Helen Yannakoudakis (2016) Compositional Sequence Labeling Models for Error Detection
in Learner Writing. ACL 2016.
• Composing words into
context-specific
representations.
• Predicting a probability
distribution over all the
possible labels for each
word.
14
System FCE CoNLL14-1 CoNLL14-2
BiLSTM 41.10 16.40 23.90
Neural Error Detection
First Certificate in English dataset (FCE, Yannakoudakis et al. (2011))
● 1,141 manually annotated essays, containing 450K words
● Written by learners during language examinations
● In response to prompts eliciting free-text answers
● Publicly available dataset
Evaluating error detection using F0.5
15
Additional Training Data
System FCE CoNLL14-1 CoNLL14-2
Public FCE 41.10 16.40 23.90
Private CLC 64.30 34.30 44.00
More data = better performance
We can generate artificial data:
Additional training examples for error detection
Idea 1: Randomly generate errors in correct text
16
Pattern-based Error Generation
Idea 2: Extract known error patterns and insert them into correct text
We went shop on Saturday
We went shopping on Saturday
VVD shop_VV0 II => VVD shopping_VVG II
I was shopping on Monday
I was shop on Monday
Marek Rei, Mariano Felice, Zheng Yuan and Ted Briscoe (2017) Artificial Error Generation with Machine Translation and
Syntactic Patterns. BEA 2017.
17
Translation-based Error Generation
Idea 3: Train a machine translation model to translate from correct to
incorrect text
ORIG: We are a well-mixed class with equal numbers of boys and girls, all about 20 years old.
PAT: We are a well-mixed class with equal numbers of boys an girls, all about 20 year old.
MT: We are a well-mixed class with equals numbers of boys and girls, all about 20 years old.
Normally translate between languages:
E.g. English to French
Now let’s translate for generating errors:
English to faulty English
Can use off-the-shelf machine translation tools
Marek Rei, Mariano Felice, Zheng Yuan and Ted Briscoe (2017) Artificial Error Generation with Machine Translation and
Syntactic Patterns. BEA 2017.
18
System FCE CoNLL14-1 CoNLL14-2
BiLSTM 41.10 16.40 23.90
+PAT 47.81 19.47 28.49
+MT 48.37 19.73 28.39
+PAT+MT 49.11 21.87 30.13
Artificial Error Generation
Training on 450K words of annotated data and 4.5M words of automatically
generated data.
19
Error Correction
20
Error Correction
Error detection identifies incorrect words
Error correction modifies a sentence to remove errors
We can formulate correction as a machine translation problem:
Let’s translate from incorrect English to correct English
Returns the highest scoring possible translation
Input: We can invite also people who are not members .
Output: We can also invite people who are not members .
21
Statistical Machine Translation
Text is separated into multi-word units (phrases)
Phrase alignments and translation tables are learned from parallel
datasets
Language models are used to ensure reasonable output
22
Neural Machine Translation
The encoder learns to process the source sentence and produce an
informative vector representation
The decoder learns to generate a sentence in a different language based
on that vector
Bahdanau et al. (2014), figure by Stephen Merity.
23
Input: I aren’t seen Albert since last summer .
Output: I haven’t seen OOV since last summer .
Handling Unknown Words
Neural models have a limited fixed vocabulary and represent other words
as OOV tokens.
Solution:
1) Align the words between the input and output text
2) Translate OOV words in a post-processing step
Zheng Yuan and Ted Briscoe (2016) Grammatical error correction using neural machine translation. NAACL 2016.
24
System FCE CoNLL14
SMT 52.90 37.33
NMT+align 53.49 39.90
Neural Machine Translation
25
Original sentence:
There are some informations you have asked me about.
SMT output:
1st There are some information you have asked me about.
2nd There is some information you have asked me about.
3rd There are some information you asked me about.
4th There are some information you have asked me.
5th There are some information you have asked me for.
N-best List
26
The correction system may not know how to fix an error, therefore leave
it uncorrected.
How can we use the detection model to fix this problem and assign a
better score to each “translation”?
+ + + + + + - -
The theatre restaurant was closed for unknown reason
Scoring Candidates
27
How can we use the detection model to fix this problem and assign a
better score to each “translation”?
1.0 1.0 1.0 0.9 1.0 1.0 0.3 0.1
The theatre restaurant was closed for unknown reason
Scoring Candidates
1. Sentence correctness score: calculated based on the probability of
each of its tokens being correct.
2. Correction recall score: select the translation that has modified the
(maximum number of) words marked by the detection model as
incorrect.
3. Correction agreement score: the ratio of agreed corrections compared
to the disagreed corrections.
Helen Yannakoudakis, Marek Rei, Øistein E. Andersen and Zheng Yuan (2017) Neural Sequence-Labelling Models for
Grammatical Error Correction. EMNLP 2017.
28
System FCE CoNLL14
SMT 52.90 37.33
NMT+align 53.49 39.90
Detect+correct 55.60 42.44
Neural Machine Translation
29
Original sentence:
I work with children an the Computer help my Jop
bat affeted to
MT output:
I work with children and the Computer help my Jop
bat affeted to
MT+detection output:
I work with children and the computer helps my Jop
bat affeted to
Error Correction Results
30
Original sentence:
It takes 25 minutes that is convenient to us
MT output:
It takes 25 minutes that is convenient for us
MT+detection output:
It takes 25 minutes , which is convenient for us
Error Correction Results
31
Original sentence:
I hope that our friend Richard Brown doesn’t have
any serious willness
MT output:
I hope that our friend Richard Brown doesn’t have
any serious willness
MT+detection output:
I hope that our friend Richard Brown doesn’t have
any serious willingness
Error Correction Results
32
Essay Scoring
33
Essay Scoring
Automatically assign a language proficiency score based on a freeform
short essay.
34
Feature-based Essay Scoring
Extract a number of features:
● Word sequences
○ Unigrams
○ Bigrams
○ Trigrams
● Part-of-speech tags
● Grammatical
constructions
● Complexity measures
● Semantic similarity
between sentences
● Estimated error count
Helen Yannakoudakis, Ted Briscoe and Ben Medlock (2011) A New Dataset and Method for Automatically Grading ESOL
Texts. ACL 2011.
35
Feature-based Essay Scoring
Features Spearman (⍴) %
Word sequences 59.8
+ POS tags 68.7
+ Syntax structure 72.2
+ Error rate 78.5
Human-human 79.2
36
Neural Essay Scoring
Bi-directional LSTM Convolutional Network
37
Score-specific Word Embeddings
Optimising word embeddings to:
1) differentiate between correct
and randomly corrupted
sequences
2) predict the score of the
essay where the current
word sequence came from
Then use these embeddings in a
neural network for essay scoring.
Dimitrios Alikaniotis, Helen Yannakoudakis and Marek Rei (2016) Automatic Text Scoring Using Neural Networks.
ACL 2016.
38
Score-specific Word Embeddings
Pre-training Spearman (⍴) % RMSE
None 68 7.31
word2vec 79 3.2
SSWE 91 2.4
Evaluating score-specific word embeddings on the ASAP dataset: 13K marked
essays (150-550 words each).
Using a two-layer bi-directional LSTM for essay scoring.
39
Error-specific Word Embeddings
Taking advantage of the available
error annotation in the training
data.
Optimising embeddings to detect
real errors, as opposed to
randomly corrupted sequences.
Network predicts the quality of
each word sequence, based on
the number of errors it contains.
Youmna Farag, Marek Rei and Ted Briscoe (2017) An Error-Oriented Approach to Word Embedding Pre-Training.
BEA 2017.
40
Pre-training Spearman (⍴) % RMSE
word2vec 56.7 4.9
Glove 51.8 5.2
SSWE 58.3 4.9
ESWE 63.7 4.5
Error-specific Word Embeddings
Evaluating error-specific word embeddings on the FCE dataset.
Using the convolutional network for essay scoring.
41
Future Directions
42
Future Directions
Personalisation
Generating exercises
that are designed for a
specific user
Automated tutoring
Active teaching from an
automated dialogue
system
Speech
Evaluating and providing
feedback for spoken
answers
43
Future Directions
Specialised systems
Supervised models
targeting specific error
types
Multi-task learning
Taking better advantage
of other tasks and
datasets
Multi-modal topics
Students writing about
images or videos
44
Summary
Error detection
Neural sequence labelling architecture
Artificial data generation
01
Error correction
Neural machine translation
Reranking with detection
02
Essay scoring
Feature-based model
Neural essay scoring
Score-specific word embeddings
03
BE THE BEST MARKETING COMPANY
45
Thank you!
Any questions?

More Related Content

What's hot

1.1 review on algebra 1
1.1 review on algebra 11.1 review on algebra 1
1.1 review on algebra 1math265
 
Derivate
DerivateDerivate
7 cavalieri principle-x
7 cavalieri principle-x7 cavalieri principle-x
7 cavalieri principle-x
math266
 
Ideals and factor rings
Ideals and factor ringsIdeals and factor rings
Ideals and factor rings
dianageorge27
 
28 mac laurin expansions x
28 mac laurin expansions x28 mac laurin expansions x
28 mac laurin expansions x
math266
 
27 power series x
27 power series x27 power series x
27 power series x
math266
 
16 partial fraction decompositions x
16 partial fraction decompositions x16 partial fraction decompositions x
16 partial fraction decompositions x
math266
 
Topology M.Sc. 2 semester Mathematics compactness, unit - 4
Topology M.Sc. 2 semester Mathematics compactness, unit - 4Topology M.Sc. 2 semester Mathematics compactness, unit - 4
Topology M.Sc. 2 semester Mathematics compactness, unit - 4
Shri Shankaracharya College, Bhilai,Junwani
 
Práctica de la Unidad I - Discreta - UTN-FRT
Práctica de la Unidad I - Discreta - UTN-FRTPráctica de la Unidad I - Discreta - UTN-FRT
Práctica de la Unidad I - Discreta - UTN-FRT
Ing. Marisel Bedrán
 
3.6 applications in optimization
3.6 applications in optimization3.6 applications in optimization
3.6 applications in optimizationmath265
 
Relaciones y funciones
Relaciones y funcionesRelaciones y funciones
Relaciones y funciones
Marlon Coti
 
3.3 graphs of factorable polynomials and rational functions
3.3 graphs of factorable polynomials and rational functions3.3 graphs of factorable polynomials and rational functions
3.3 graphs of factorable polynomials and rational functionsmath265
 
Cuantificadores
CuantificadoresCuantificadores
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Edureka!
 
1.2 review on algebra 2-sign charts and inequalities
1.2 review on algebra 2-sign charts and inequalities1.2 review on algebra 2-sign charts and inequalities
1.2 review on algebra 2-sign charts and inequalitiesmath265
 
15 integrals of trig products-i-x
15 integrals of trig products-i-x15 integrals of trig products-i-x
15 integrals of trig products-i-x
math266
 
1.3 review on trig functions
1.3 review on trig functions1.3 review on trig functions
1.3 review on trig functionsmath265
 
Topological Data Analysis and Persistent Homology
Topological Data Analysis and Persistent HomologyTopological Data Analysis and Persistent Homology
Topological Data Analysis and Persistent Homology
Carla Melia
 
20 sequences x
20 sequences x20 sequences x
20 sequences x
math266
 

What's hot (20)

1.1 review on algebra 1
1.1 review on algebra 11.1 review on algebra 1
1.1 review on algebra 1
 
Derivate
DerivateDerivate
Derivate
 
7 cavalieri principle-x
7 cavalieri principle-x7 cavalieri principle-x
7 cavalieri principle-x
 
Ideals and factor rings
Ideals and factor ringsIdeals and factor rings
Ideals and factor rings
 
28 mac laurin expansions x
28 mac laurin expansions x28 mac laurin expansions x
28 mac laurin expansions x
 
27 power series x
27 power series x27 power series x
27 power series x
 
16 partial fraction decompositions x
16 partial fraction decompositions x16 partial fraction decompositions x
16 partial fraction decompositions x
 
Topology M.Sc. 2 semester Mathematics compactness, unit - 4
Topology M.Sc. 2 semester Mathematics compactness, unit - 4Topology M.Sc. 2 semester Mathematics compactness, unit - 4
Topology M.Sc. 2 semester Mathematics compactness, unit - 4
 
Práctica de la Unidad I - Discreta - UTN-FRT
Práctica de la Unidad I - Discreta - UTN-FRTPráctica de la Unidad I - Discreta - UTN-FRT
Práctica de la Unidad I - Discreta - UTN-FRT
 
3.6 applications in optimization
3.6 applications in optimization3.6 applications in optimization
3.6 applications in optimization
 
Relaciones y funciones
Relaciones y funcionesRelaciones y funciones
Relaciones y funciones
 
3.3 graphs of factorable polynomials and rational functions
3.3 graphs of factorable polynomials and rational functions3.3 graphs of factorable polynomials and rational functions
3.3 graphs of factorable polynomials and rational functions
 
Cuantificadores
CuantificadoresCuantificadores
Cuantificadores
 
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
 
1.2 review on algebra 2-sign charts and inequalities
1.2 review on algebra 2-sign charts and inequalities1.2 review on algebra 2-sign charts and inequalities
1.2 review on algebra 2-sign charts and inequalities
 
Matematica
MatematicaMatematica
Matematica
 
15 integrals of trig products-i-x
15 integrals of trig products-i-x15 integrals of trig products-i-x
15 integrals of trig products-i-x
 
1.3 review on trig functions
1.3 review on trig functions1.3 review on trig functions
1.3 review on trig functions
 
Topological Data Analysis and Persistent Homology
Topological Data Analysis and Persistent HomologyTopological Data Analysis and Persistent Homology
Topological Data Analysis and Persistent Homology
 
20 sequences x
20 sequences x20 sequences x
20 sequences x
 

Similar to Grammarly AI-NLP Club #4 - Understanding and assessing language with neural network models - Marek Rei

Neural machine translation of rare words with subword units
Neural machine translation of rare words with subword unitsNeural machine translation of rare words with subword units
Neural machine translation of rare words with subword units
Tae Hwan Jung
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLP
MENGSAYLOEM1
 
Deep network notes.pdf
Deep network notes.pdfDeep network notes.pdf
Deep network notes.pdf
Ramya Nellutla
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLifeng (Aaron) Han
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
Lifeng (Aaron) Han
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problem
JaeHo Jang
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine Translation
Gennadi Lembersky
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...
jcscholtes
 
BEA12_sakaguchi
BEA12_sakaguchiBEA12_sakaguchi
BEA12_sakaguchi
Keisuke Sakaguchi
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
Roelof Pieters
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Marcin Junczys-Dowmunt
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
Fabio Petroni, PhD
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for TranslationRIILP
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Kotaro Hara
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
kevig
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
kevig
 
Prompt-Engineering-Lecture-Elvis learn prompt engineering
Prompt-Engineering-Lecture-Elvis learn prompt engineeringPrompt-Engineering-Lecture-Elvis learn prompt engineering
Prompt-Engineering-Lecture-Elvis learn prompt engineering
SaweraKhadium
 
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
Alp Öktem
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
Hemantha Kulathilake
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
Anuj Gupta
 

Similar to Grammarly AI-NLP Club #4 - Understanding and assessing language with neural network models - Marek Rei (20)

Neural machine translation of rare words with subword units
Neural machine translation of rare words with subword unitsNeural machine translation of rare words with subword units
Neural machine translation of rare words with subword units
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLP
 
Deep network notes.pdf
Deep network notes.pdfDeep network notes.pdf
Deep network notes.pdf
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metric
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problem
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine Translation
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...
 
BEA12_sakaguchi
BEA12_sakaguchiBEA12_sakaguchi
BEA12_sakaguchi
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
 
13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation13. Constantin Orasan (UoW) Natural Language Processing for Translation
13. Constantin Orasan (UoW) Natural Language Processing for Translation
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
 
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGEADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
ADVERSARIAL GRAMMATICAL ERROR GENERATION: APPLICATION TO PERSIAN LANGUAGE
 
Prompt-Engineering-Lecture-Elvis learn prompt engineering
Prompt-Engineering-Lecture-Elvis learn prompt engineeringPrompt-Engineering-Lecture-Elvis learn prompt engineering
Prompt-Engineering-Lecture-Elvis learn prompt engineering
 
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 

More from Grammarly

Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering
Grammarly
 
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
Grammarly
 
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
Grammarly
 
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
Grammarly
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly
 
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
Grammarly
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly
 
Grammarly Meetup: DevOps at Grammarly: Scaling 100x
Grammarly Meetup: DevOps at Grammarly: Scaling 100xGrammarly Meetup: DevOps at Grammarly: Scaling 100x
Grammarly Meetup: DevOps at Grammarly: Scaling 100x
Grammarly
 
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
Grammarly
 
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly
 
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
Grammarly
 
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy GryshchukGrammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
Grammarly
 
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy GutsGrammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
Grammarly
 
Natural Language Processing for biomedical text mining - Thierry Hamon
Natural Language Processing for biomedical text mining - Thierry HamonNatural Language Processing for biomedical text mining - Thierry Hamon
Natural Language Processing for biomedical text mining - Thierry Hamon
Grammarly
 

More from Grammarly (14)

Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering
 
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
Grammarly AI-NLP Club #10 - Information-Theoretic Probing with Minimum Descri...
 
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
Grammarly AI-NLP Club #9 - Dumpster diving for parallel corpora with efficien...
 
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
Grammarly AI-NLP Club #8 - Arabic Natural Language Processing: Challenges and...
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
 
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
Grammarly AI-NLP Club #5 - Automatic text simplification in the biomedical do...
 
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
Grammarly AI-NLP Club #3 - Learning to Read for Automated Fact Checking - Isa...
 
Grammarly Meetup: DevOps at Grammarly: Scaling 100x
Grammarly Meetup: DevOps at Grammarly: Scaling 100xGrammarly Meetup: DevOps at Grammarly: Scaling 100x
Grammarly Meetup: DevOps at Grammarly: Scaling 100x
 
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
Grammarly Meetup: Memory Networks for Question Answering on Tabular Data - Sv...
 
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
Grammarly AI-NLP Club #2 - Recent advances in applied chatbot technology - Jo...
 
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
Grammarly AI-NLP Club #1 - Domain and Social Bias in NLP: Case Study in Langu...
 
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy GryshchukGrammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
Grammarly Meetup: Paraphrase Detection in NLP (PART 2) - Andriy Gryshchuk
 
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy GutsGrammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
Grammarly Meetup: Paraphrase Detection in NLP (PART 1) - Yuriy Guts
 
Natural Language Processing for biomedical text mining - Thierry Hamon
Natural Language Processing for biomedical text mining - Thierry HamonNatural Language Processing for biomedical text mining - Thierry Hamon
Natural Language Processing for biomedical text mining - Thierry Hamon
 

Recently uploaded

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 

Recently uploaded (20)

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 

Grammarly AI-NLP Club #4 - Understanding and assessing language with neural network models - Marek Rei

  • 1. 1 Understanding and Assessing Language with Neural Network Models Marek Rei
  • 2. 2 Automated Language Assessment The number of people learning English around the world is currently estimated at 1.5 billion and is predicted to exceed 1.9 billion by 2020. Advantages for students: • Immediate grades and feedback • Enables self-assessment and self-tutoring • Constant availability as an online tool Advantages for teachers/examiners: • Reduced teacher/examiner workload • Can focus on more interesting or difficult content • Cost-effective approach to assessment
  • 3. 3 Automated Language Assessment Dear Mrs Brown, I am writing you because my class want to give a surprise birthday party for your husband Mr Brown. We need your help for the details. First of all could you let us know if the date of June 16th is all right with his timetable program. We have organised to do the party between three to six o'clock in afternoon in College Canteen, about food we organised a buffet, but could you also help us with the music which he prefer, if prefer something especialy. We have invite the student, the teachers and the Principal of school but we appreciate if you are coming. At last would you tell us which is the best present for him a compact disk or a book . We want say thanks again for your help and you must be sure that your opinion it would be valuable to us. I am looking forward to receiving your answer and don't forget that it is a surprice birthday party. Yours faithfuly, Tom Evaluation: ● Detect any writing errors ● Calculate a holistic writing score ● Predict language proficiency score (IELTS, FCE) ● Detailed analytic scores (e.g., coherence, topic relevance) Guidance: ● Show detailed progress reports ● Provide corrections for errors ● Suggest areas to focus on ● Generate suitable exercises
  • 4. 4 Talk Overview Error Detection Identifying the locations of grammatical errors 01 Error Correction Providing an edited version of an incorrect sentence 02 Applications and Future Directions How do we make this useful and where do we go next 04 Essay Scoring Estimating a language proficiency score based on the full text 03
  • 5. 5 + DTAL + Engineering + Cambridge English
  • 7. 7 I want to thak you for preparing such a nice evening . Error Detection in Learner Writing
  • 8. 8 Error Types in Learner Writing
  • 9. 9 I want to thak you for preparing such a nice evening . Error Detection in Learner Writing Spelling error (8.6%) I know how to cook some things like potatoes . Missing punctuation (7.4%) If you have time , why don’t you meet up . Incorrect punctuation (7.1%) I’m looking forward to seeing you and good luck to your project . Incorrect preposition (6.3%) My friend eats two ice creams yesterday . Verb tense error (6.0%)
  • 10. 10 We can invite also people who are not members . Error Detection in Learner Writing Word order error (2.8%) The main material that have been used is dark green glass . Verb agreement error (1.6%) I thing you should better save your money . Spelling error produces a valid word (1.5%) And at last but not the least , Captain Davidson showed him ... Incorrectly reproduced idiom (0.5%) Specially the old castle Wawel's great . Complex error (0.5%)
  • 11. 11 Automated Error Detection 1. Experts have hand-annotated a large dataset of learner essays, marking the location of each error. 2. We create algorithms that can look at all these examples and discover regularities through machine learning. 3. We apply the resulting models on new data, where they are able to provide predictions.
  • 12. 12 Deep Learning and Neural Networks • Highly-connected networks of parameters • Randomly initialised, but optimised for a specific task during training • Automatically discovering features that are useful for the task • Each layer is a function of the previous layer • Have achieved state-of-the-art results on nearly all language processing tasks
  • 13. 13 Neural Error Detection Marek Rei and Helen Yannakoudakis (2016) Compositional Sequence Labeling Models for Error Detection in Learner Writing. ACL 2016. • Composing words into context-specific representations. • Predicting a probability distribution over all the possible labels for each word.
  • 14. 14 System FCE CoNLL14-1 CoNLL14-2 BiLSTM 41.10 16.40 23.90 Neural Error Detection First Certificate in English dataset (FCE, Yannakoudakis et al. (2011)) ● 1,141 manually annotated essays, containing 450K words ● Written by learners during language examinations ● In response to prompts eliciting free-text answers ● Publicly available dataset Evaluating error detection using F0.5
  • 15. 15 Additional Training Data System FCE CoNLL14-1 CoNLL14-2 Public FCE 41.10 16.40 23.90 Private CLC 64.30 34.30 44.00 More data = better performance We can generate artificial data: Additional training examples for error detection Idea 1: Randomly generate errors in correct text
  • 16. 16 Pattern-based Error Generation Idea 2: Extract known error patterns and insert them into correct text We went shop on Saturday We went shopping on Saturday VVD shop_VV0 II => VVD shopping_VVG II I was shopping on Monday I was shop on Monday Marek Rei, Mariano Felice, Zheng Yuan and Ted Briscoe (2017) Artificial Error Generation with Machine Translation and Syntactic Patterns. BEA 2017.
  • 17. 17 Translation-based Error Generation Idea 3: Train a machine translation model to translate from correct to incorrect text ORIG: We are a well-mixed class with equal numbers of boys and girls, all about 20 years old. PAT: We are a well-mixed class with equal numbers of boys an girls, all about 20 year old. MT: We are a well-mixed class with equals numbers of boys and girls, all about 20 years old. Normally translate between languages: E.g. English to French Now let’s translate for generating errors: English to faulty English Can use off-the-shelf machine translation tools Marek Rei, Mariano Felice, Zheng Yuan and Ted Briscoe (2017) Artificial Error Generation with Machine Translation and Syntactic Patterns. BEA 2017.
  • 18. 18 System FCE CoNLL14-1 CoNLL14-2 BiLSTM 41.10 16.40 23.90 +PAT 47.81 19.47 28.49 +MT 48.37 19.73 28.39 +PAT+MT 49.11 21.87 30.13 Artificial Error Generation Training on 450K words of annotated data and 4.5M words of automatically generated data.
  • 20. 20 Error Correction Error detection identifies incorrect words Error correction modifies a sentence to remove errors We can formulate correction as a machine translation problem: Let’s translate from incorrect English to correct English Returns the highest scoring possible translation Input: We can invite also people who are not members . Output: We can also invite people who are not members .
  • 21. 21 Statistical Machine Translation Text is separated into multi-word units (phrases) Phrase alignments and translation tables are learned from parallel datasets Language models are used to ensure reasonable output
  • 22. 22 Neural Machine Translation The encoder learns to process the source sentence and produce an informative vector representation The decoder learns to generate a sentence in a different language based on that vector Bahdanau et al. (2014), figure by Stephen Merity.
  • 23. 23 Input: I aren’t seen Albert since last summer . Output: I haven’t seen OOV since last summer . Handling Unknown Words Neural models have a limited fixed vocabulary and represent other words as OOV tokens. Solution: 1) Align the words between the input and output text 2) Translate OOV words in a post-processing step Zheng Yuan and Ted Briscoe (2016) Grammatical error correction using neural machine translation. NAACL 2016.
  • 24. 24 System FCE CoNLL14 SMT 52.90 37.33 NMT+align 53.49 39.90 Neural Machine Translation
  • 25. 25 Original sentence: There are some informations you have asked me about. SMT output: 1st There are some information you have asked me about. 2nd There is some information you have asked me about. 3rd There are some information you asked me about. 4th There are some information you have asked me. 5th There are some information you have asked me for. N-best List
  • 26. 26 The correction system may not know how to fix an error, therefore leave it uncorrected. How can we use the detection model to fix this problem and assign a better score to each “translation”? + + + + + + - - The theatre restaurant was closed for unknown reason Scoring Candidates
  • 27. 27 How can we use the detection model to fix this problem and assign a better score to each “translation”? 1.0 1.0 1.0 0.9 1.0 1.0 0.3 0.1 The theatre restaurant was closed for unknown reason Scoring Candidates 1. Sentence correctness score: calculated based on the probability of each of its tokens being correct. 2. Correction recall score: select the translation that has modified the (maximum number of) words marked by the detection model as incorrect. 3. Correction agreement score: the ratio of agreed corrections compared to the disagreed corrections. Helen Yannakoudakis, Marek Rei, Øistein E. Andersen and Zheng Yuan (2017) Neural Sequence-Labelling Models for Grammatical Error Correction. EMNLP 2017.
  • 28. 28 System FCE CoNLL14 SMT 52.90 37.33 NMT+align 53.49 39.90 Detect+correct 55.60 42.44 Neural Machine Translation
  • 29. 29 Original sentence: I work with children an the Computer help my Jop bat affeted to MT output: I work with children and the Computer help my Jop bat affeted to MT+detection output: I work with children and the computer helps my Jop bat affeted to Error Correction Results
  • 30. 30 Original sentence: It takes 25 minutes that is convenient to us MT output: It takes 25 minutes that is convenient for us MT+detection output: It takes 25 minutes , which is convenient for us Error Correction Results
  • 31. 31 Original sentence: I hope that our friend Richard Brown doesn’t have any serious willness MT output: I hope that our friend Richard Brown doesn’t have any serious willness MT+detection output: I hope that our friend Richard Brown doesn’t have any serious willingness Error Correction Results
  • 33. 33 Essay Scoring Automatically assign a language proficiency score based on a freeform short essay.
  • 34. 34 Feature-based Essay Scoring Extract a number of features: ● Word sequences ○ Unigrams ○ Bigrams ○ Trigrams ● Part-of-speech tags ● Grammatical constructions ● Complexity measures ● Semantic similarity between sentences ● Estimated error count Helen Yannakoudakis, Ted Briscoe and Ben Medlock (2011) A New Dataset and Method for Automatically Grading ESOL Texts. ACL 2011.
  • 35. 35 Feature-based Essay Scoring Features Spearman (⍴) % Word sequences 59.8 + POS tags 68.7 + Syntax structure 72.2 + Error rate 78.5 Human-human 79.2
  • 36. 36 Neural Essay Scoring Bi-directional LSTM Convolutional Network
  • 37. 37 Score-specific Word Embeddings Optimising word embeddings to: 1) differentiate between correct and randomly corrupted sequences 2) predict the score of the essay where the current word sequence came from Then use these embeddings in a neural network for essay scoring. Dimitrios Alikaniotis, Helen Yannakoudakis and Marek Rei (2016) Automatic Text Scoring Using Neural Networks. ACL 2016.
  • 38. 38 Score-specific Word Embeddings Pre-training Spearman (⍴) % RMSE None 68 7.31 word2vec 79 3.2 SSWE 91 2.4 Evaluating score-specific word embeddings on the ASAP dataset: 13K marked essays (150-550 words each). Using a two-layer bi-directional LSTM for essay scoring.
  • 39. 39 Error-specific Word Embeddings Taking advantage of the available error annotation in the training data. Optimising embeddings to detect real errors, as opposed to randomly corrupted sequences. Network predicts the quality of each word sequence, based on the number of errors it contains. Youmna Farag, Marek Rei and Ted Briscoe (2017) An Error-Oriented Approach to Word Embedding Pre-Training. BEA 2017.
  • 40. 40 Pre-training Spearman (⍴) % RMSE word2vec 56.7 4.9 Glove 51.8 5.2 SSWE 58.3 4.9 ESWE 63.7 4.5 Error-specific Word Embeddings Evaluating error-specific word embeddings on the FCE dataset. Using the convolutional network for essay scoring.
  • 42. 42 Future Directions Personalisation Generating exercises that are designed for a specific user Automated tutoring Active teaching from an automated dialogue system Speech Evaluating and providing feedback for spoken answers
  • 43. 43 Future Directions Specialised systems Supervised models targeting specific error types Multi-task learning Taking better advantage of other tasks and datasets Multi-modal topics Students writing about images or videos
  • 44. 44 Summary Error detection Neural sequence labelling architecture Artificial data generation 01 Error correction Neural machine translation Reranking with detection 02 Essay scoring Feature-based model Neural essay scoring Score-specific word embeddings 03 BE THE BEST MARKETING COMPANY