SlideShare a Scribd company logo
1 of 31
Download to read offline
The Allen AI Science
Challenge &
DeepHack.Q&A
St. Petersburg Data Science Meetup #6, Feb 19th, 2016
Q: When athletes begin to exercise, their heart rates and respiration rates
increase. At what level of organization does the human body coordinate these
functions?
A. at the tissue level
B. at the organ level
C. at the system level
D. at the cellular level
Wed 7 Oct 2015 – Sat 13 Feb 2016
Stage 1: 800 teams (>1000 participants),
Stage 2: 170 teams
https://www.kaggle.com/c/the-allen-ai-science-challenge
2700 questions - train set
8132 questions - validation set
21298 questions - final test set
DeepHack Q&A qa.deephack.me/
Qualification round: Top-50 participants with the
highest scores
Rough competition: Kaggle Top-40 to get to the Top-50 o_O
Winter ML school + hackathon: 31st, Jan - 5th Feb, 2016
GP team created at Jan, 31st from the four teams
The final 30 minutes of the hackathon: https:
//www.youtube.com/watch?v=tCKL5vbiHuo
Pavel Kalaidin (VK)
Marat Zainutdinov (Quantbrothers)
Roman Trusov (ITMO University)
Artyom Korkhov (Zvooq)
Igor Shilov (Zvooq)
Timur Luguev (Clevapi)
Ilyas Luguev (Clevapi)
Team Generation Gap
DeepHack: 1st, ~0.556
Allen AI: 7th, 0.55059
Datasets
ck12.org
wikipedia.org (science subset)
flashcards: studystack.com,
quizlet.com
Topic at the forum: https://www.kaggle.com/c/the-allen-ai-science-challenge/forums/t/16877/external-data-
repository
Hail to Lucene
Lucene
Question
a) ans1
b) ans2
c) ans3
d) ans4
Question ans1
Question ans4
Question ans3
Question ans2
0.5
0.4
0.02
0.01
...
0.5
0.4
0.02
0.01
...
0.5
0.4
0.02
0.01
...
0.5
0.4
0.02
0.01
...
Wiki ck12
quizlets
Stemming,stopwords
Custom queries rule
Lucene scores: https://lucene.apache.
org/core/3_5_0/api/core/org/apache/lucene/search/Similarity.html
AdaGram (a.k.a Reptil)
Breaking Sticks and Ambiguities with Adaptive Skip-gram: http:
//arxiv.org/abs/1502.07257
Reference implementation in Julia: https://github.
com/sbos/AdaGram.jl
reptil art cultur final play
signific role folklor
religion popular cultur moch
peopl noun coldblood anim
scale general move stomach
short leg exampl snake lizard
turtl noun aw person
Model trained like this: sh train.sh --min-freq 20 --window 5 --workers 40 --epochs 5
--dim 300 --alpha 0.1 corpus.txt adam.dict adam.model
Number of prototypes is 5 by default.
AdaGram (a.k.a Reptil) approach
[0.42, 0.55, 0.08, …]
N-grams PMI
x, y - Ngrams
Example 1-gram -> 1-gram
unit -> state
magnet -> field
carbon -> dioxid
million -> year
year -> ago
amino -> acid
Example 1-gram -> 3-gram
around -> million year ago
period -> million year ago
forc -> van der waal
fossil -> million year ago
nobel -> prize physiolog medicin
date -> million year ago
mercuri -> venus earth mar
N-grams PMI
greatest contributor air pollut unit state
What is the greatest contributor to air pollution
in the United States?
greatest
contributor
air
...
greatest contributor
contributor air
air pollut
...
1-grams 2-grams 3-grams
Power plants
power plant
power
plant
1-grams
power plant
2-grams
...
Scores
Fail Story
TL;DR wasted tons of time, got ~0.3 in
almost all approaches
LSA + Lucene
Corpus
LSA
TI_1
TI_2
TI_n
Lucene
qa pair 1
qa pair 2
qa pair 3
qa pair 4
Queries in
topic
indices
Result: for each qa pair,
max(s1...sn)
Gave 1% improvement over
basic Lucene; but took
EXTREMELY long time to
process :(
Syntax co-occurrence
nobel chemistry prize 517
national science academy 445
long time period 340
also role play 306
nobel physic prize 279
national medical library 273
carbon water dioxide 261
second thermodynamics law 247
speed sound of_pobj
density population compound
take place dobj
link external compound 0.3 :(
word2vec combinations
Wanted to capture the intersection of meanings, but didn’t know
how to combine word2vec representations
TFIDFqa pairs
Combinations of
question tokens
Combinations of
answer tokens
Cosine
similarity
Max score ~ 0.3 :( even with careful kw filtering
word2gauss didn’t help too
Averaging Neural networks (1st encounter)
w2v_dim = 300
vec_q = mean(w2v(Q))
vec_c = mean(w2v(Ac))
vec_w = mean(w2v(Aw))
cos_sim(vec_q, vec_c) > cos_sim(vec_q, vec_w)
Averaging Neural networks (1st encounter)
w2v_dim = 300
vec_q = mean(w2v(Q))
vec_c = mean(w2v(Ac))
vec_w = mean(w2v(Aw))
a = CNN(w2v(X))
vec_x = mean(w2v(X) * a)
cos_sim(vec_q, vec_c) > cos_sim(vec_q, vec_w)
Semantic Neural networks (2nd encounter)
+ Paragraphs
LSTM = LSTM(w2v)
LSTM(s1 | s2) > LSTM(s1 | s3) if s1 and s2 are from the same
paragraph, while s1 and s3 are not
LSTM(a, b) is low then a and b are from the same paragraph (energy
based learning)
Loss = max(0, M - LSTM(s1, s2) + LSTM(s1, s3))
Score: 0.26
Siamese architecture
Hinge Loss
Margin
Reading Neural networks (3rd encounter)
+ Lots of paragraphs
+ Search Engine
+ A survey:
- bigrams are not accounted
- main idea (keywords) of a sentence is not recognized
Reading Neural networks (3rd encounter)
+ Lots of paragraphs
+ Search Engine
+ A survey:
- bigrams are not accounted
- main idea (keywords) of a sentence is not recognized
Reading Neural networks (3rd encounter)
All we want is to know if a sentence is from a paragraph to be
able to rerank lucene scores.
Hinge Loss
Margin
LSTM(P)
LSTM(s1)
LSTM(s2)
Reading Neural networks (3rd encounter)
sentences -> LSTM -> Dense NN -> Embedding
w2v -> LSTM -> Dense NN -> Embedding
w2v -> Mean -> Dense NN -> Embedding
Neural networks. Learned lessons.
Start as small as possible
Corruption is important for siamese networks
Learning curve is misleading in NLP
Lessons learned
Start early - wasted two first months of the competition (but had a week
of 24/7 hackathon at the end)
No stickers in the team channel (except with Yann LeCun on a good submit)
Common toolbox is nice
A dedicated server is a good thing to have (no need in AWS spot instances)
Experiment fast, fail early
Team work means a lot

More Related Content

Similar to The Allen AI Science Challenge

Machine Learning
Machine LearningMachine Learning
Machine Learning
butest
 
Lecture 2 data structures & algorithms - sorting techniques
Lecture 2  data structures & algorithms - sorting techniquesLecture 2  data structures & algorithms - sorting techniques
Lecture 2 data structures & algorithms - sorting techniques
Dharmendra Prasad
 
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
butest
 
Stack squeues lists
Stack squeues listsStack squeues lists
Stack squeues lists
James Wong
 
Stacksqueueslists
StacksqueueslistsStacksqueueslists
Stacksqueueslists
Fraboni Ec
 
Stacks queues lists
Stacks queues listsStacks queues lists
Stacks queues lists
Tony Nguyen
 

Similar to The Allen AI Science Challenge (20)

Introduction to Algorithms
Introduction to AlgorithmsIntroduction to Algorithms
Introduction to Algorithms
 
2.03.Asymptotic_analysis.pptx
2.03.Asymptotic_analysis.pptx2.03.Asymptotic_analysis.pptx
2.03.Asymptotic_analysis.pptx
 
String kmp
String kmpString kmp
String kmp
 
Computing k-rank Answers with Ontological CP-nets
Computing k-rank Answers with Ontological CP-netsComputing k-rank Answers with Ontological CP-nets
Computing k-rank Answers with Ontological CP-nets
 
A Network-Aware Approach for Searching As-You-Type in Social Media
A Network-Aware Approach for Searching As-You-Type in Social MediaA Network-Aware Approach for Searching As-You-Type in Social Media
A Network-Aware Approach for Searching As-You-Type in Social Media
 
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMYComputer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Lecture 2 data structures & algorithms - sorting techniques
Lecture 2  data structures & algorithms - sorting techniquesLecture 2  data structures & algorithms - sorting techniques
Lecture 2 data structures & algorithms - sorting techniques
 
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
Software-Praktikum SoSe 2005 Lehrstuhl fuer Maschinelles ...
 
Stack squeues lists
Stack squeues listsStack squeues lists
Stack squeues lists
 
Stacksqueueslists
StacksqueueslistsStacksqueueslists
Stacksqueueslists
 
Stacks queues lists
Stacks queues listsStacks queues lists
Stacks queues lists
 
Stacks queues lists
Stacks queues listsStacks queues lists
Stacks queues lists
 
Stacks queues lists
Stacks queues listsStacks queues lists
Stacks queues lists
 
Stacks queues lists
Stacks queues listsStacks queues lists
Stacks queues lists
 
algo1-aa-gist_typed.pdf
algo1-aa-gist_typed.pdfalgo1-aa-gist_typed.pdf
algo1-aa-gist_typed.pdf
 
time_complexity_list_02_04_2024_22_pages.pdf
time_complexity_list_02_04_2024_22_pages.pdftime_complexity_list_02_04_2024_22_pages.pdf
time_complexity_list_02_04_2024_22_pages.pdf
 
Introduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdfIntroduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdf
 
Lecture38
Lecture38Lecture38
Lecture38
 
NIPS2017 Few-shot Learning and Graph Convolution
NIPS2017 Few-shot Learning and Graph ConvolutionNIPS2017 Few-shot Learning and Graph Convolution
NIPS2017 Few-shot Learning and Graph Convolution
 

Recently uploaded

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 

Recently uploaded (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 

The Allen AI Science Challenge

  • 1. The Allen AI Science Challenge & DeepHack.Q&A St. Petersburg Data Science Meetup #6, Feb 19th, 2016
  • 2. Q: When athletes begin to exercise, their heart rates and respiration rates increase. At what level of organization does the human body coordinate these functions? A. at the tissue level B. at the organ level C. at the system level D. at the cellular level Wed 7 Oct 2015 – Sat 13 Feb 2016 Stage 1: 800 teams (>1000 participants), Stage 2: 170 teams https://www.kaggle.com/c/the-allen-ai-science-challenge 2700 questions - train set 8132 questions - validation set 21298 questions - final test set
  • 3. DeepHack Q&A qa.deephack.me/ Qualification round: Top-50 participants with the highest scores Rough competition: Kaggle Top-40 to get to the Top-50 o_O Winter ML school + hackathon: 31st, Jan - 5th Feb, 2016 GP team created at Jan, 31st from the four teams The final 30 minutes of the hackathon: https: //www.youtube.com/watch?v=tCKL5vbiHuo
  • 4. Pavel Kalaidin (VK) Marat Zainutdinov (Quantbrothers) Roman Trusov (ITMO University) Artyom Korkhov (Zvooq) Igor Shilov (Zvooq) Timur Luguev (Clevapi) Ilyas Luguev (Clevapi) Team Generation Gap DeepHack: 1st, ~0.556 Allen AI: 7th, 0.55059
  • 5. Datasets ck12.org wikipedia.org (science subset) flashcards: studystack.com, quizlet.com Topic at the forum: https://www.kaggle.com/c/the-allen-ai-science-challenge/forums/t/16877/external-data- repository
  • 6. Hail to Lucene Lucene Question a) ans1 b) ans2 c) ans3 d) ans4 Question ans1 Question ans4 Question ans3 Question ans2 0.5 0.4 0.02 0.01 ... 0.5 0.4 0.02 0.01 ... 0.5 0.4 0.02 0.01 ... 0.5 0.4 0.02 0.01 ... Wiki ck12 quizlets Stemming,stopwords
  • 7. Custom queries rule Lucene scores: https://lucene.apache. org/core/3_5_0/api/core/org/apache/lucene/search/Similarity.html
  • 8. AdaGram (a.k.a Reptil) Breaking Sticks and Ambiguities with Adaptive Skip-gram: http: //arxiv.org/abs/1502.07257 Reference implementation in Julia: https://github. com/sbos/AdaGram.jl
  • 9. reptil art cultur final play signific role folklor religion popular cultur moch peopl noun coldblood anim scale general move stomach short leg exampl snake lizard turtl noun aw person
  • 10. Model trained like this: sh train.sh --min-freq 20 --window 5 --workers 40 --epochs 5 --dim 300 --alpha 0.1 corpus.txt adam.dict adam.model Number of prototypes is 5 by default.
  • 11. AdaGram (a.k.a Reptil) approach [0.42, 0.55, 0.08, …]
  • 12. N-grams PMI x, y - Ngrams Example 1-gram -> 1-gram unit -> state magnet -> field carbon -> dioxid million -> year year -> ago amino -> acid Example 1-gram -> 3-gram around -> million year ago period -> million year ago forc -> van der waal fossil -> million year ago nobel -> prize physiolog medicin date -> million year ago mercuri -> venus earth mar
  • 13. N-grams PMI greatest contributor air pollut unit state What is the greatest contributor to air pollution in the United States? greatest contributor air ... greatest contributor contributor air air pollut ... 1-grams 2-grams 3-grams Power plants power plant power plant 1-grams power plant 2-grams ...
  • 15. Fail Story TL;DR wasted tons of time, got ~0.3 in almost all approaches
  • 16. LSA + Lucene Corpus LSA TI_1 TI_2 TI_n Lucene qa pair 1 qa pair 2 qa pair 3 qa pair 4 Queries in topic indices Result: for each qa pair, max(s1...sn) Gave 1% improvement over basic Lucene; but took EXTREMELY long time to process :(
  • 17. Syntax co-occurrence nobel chemistry prize 517 national science academy 445 long time period 340 also role play 306 nobel physic prize 279 national medical library 273 carbon water dioxide 261 second thermodynamics law 247 speed sound of_pobj density population compound take place dobj link external compound 0.3 :(
  • 18. word2vec combinations Wanted to capture the intersection of meanings, but didn’t know how to combine word2vec representations TFIDFqa pairs Combinations of question tokens Combinations of answer tokens Cosine similarity Max score ~ 0.3 :( even with careful kw filtering word2gauss didn’t help too
  • 19. Averaging Neural networks (1st encounter) w2v_dim = 300 vec_q = mean(w2v(Q)) vec_c = mean(w2v(Ac)) vec_w = mean(w2v(Aw)) cos_sim(vec_q, vec_c) > cos_sim(vec_q, vec_w)
  • 20. Averaging Neural networks (1st encounter) w2v_dim = 300 vec_q = mean(w2v(Q)) vec_c = mean(w2v(Ac)) vec_w = mean(w2v(Aw)) a = CNN(w2v(X)) vec_x = mean(w2v(X) * a) cos_sim(vec_q, vec_c) > cos_sim(vec_q, vec_w)
  • 21. Semantic Neural networks (2nd encounter) + Paragraphs LSTM = LSTM(w2v) LSTM(s1 | s2) > LSTM(s1 | s3) if s1 and s2 are from the same paragraph, while s1 and s3 are not LSTM(a, b) is low then a and b are from the same paragraph (energy based learning) Loss = max(0, M - LSTM(s1, s2) + LSTM(s1, s3)) Score: 0.26
  • 24. Reading Neural networks (3rd encounter) + Lots of paragraphs + Search Engine + A survey: - bigrams are not accounted - main idea (keywords) of a sentence is not recognized
  • 25. Reading Neural networks (3rd encounter) + Lots of paragraphs + Search Engine + A survey: - bigrams are not accounted - main idea (keywords) of a sentence is not recognized
  • 26. Reading Neural networks (3rd encounter) All we want is to know if a sentence is from a paragraph to be able to rerank lucene scores.
  • 27.
  • 29. Reading Neural networks (3rd encounter) sentences -> LSTM -> Dense NN -> Embedding w2v -> LSTM -> Dense NN -> Embedding w2v -> Mean -> Dense NN -> Embedding
  • 30. Neural networks. Learned lessons. Start as small as possible Corruption is important for siamese networks Learning curve is misleading in NLP
  • 31. Lessons learned Start early - wasted two first months of the competition (but had a week of 24/7 hackathon at the end) No stickers in the team channel (except with Yann LeCun on a good submit) Common toolbox is nice A dedicated server is a good thing to have (no need in AWS spot instances) Experiment fast, fail early Team work means a lot