More Related Content
Similar to Pennington, Socher, and Manning. (2014) GloVe: Global vectors for word representation (20)
Pennington, Socher, and Manning. (2014) GloVe: Global vectors for word representation
- 1. GLOVE: GLOBAL VECTORS
FOR WORD REPRESENTATION
GloVe: Global Vectors for Word Representation 1
Jeffrey Pennington, Richard Socher,
Christopher D. Manning
EMNLP 2014, pages 1532â1543.
èªã¿æ: å²¡åŽ çŽèŠ³
ïŒP3ãé€ãïŒã¹ã©ã€ãäžã®è¡šã»å³ã¯ãã¹ãŠå
è«æã®åŒçšïŒ
Pennington+ (2014)
- 3. æå°äºä¹æ³ã«ããåèªãã¯ãã«ã®åŠç¿
Pennington+ (2014) GloVe: Global Vectors for Word Representation 3
ðœ =
ð,ð=1
ð
ð(ðð,ð) (ðð
ð
ðð + ðð + ðð â log ðð,ð)2
ç®çé¢æ°:
ð ð¥ =
(ð¥/ð¥max) ðŒ (if ð¥ < ð¥max)
1 (otherwise)
åèªðãšåèªðã®å
±èµ·é »åºŠåèªã®ç·æ°
åèªðã®ãã¯ãã«
åèªðã®ãã¯ãã«â
åèªðã®ãã€ã¢ã¹é
åèªðã®ãã€ã¢ã¹é
â
1系統
2系統
â»ååèªã«å¯ŸããŠãã©ã¡ã¿ã2系統ããã®ã¯
word2vecãšåæ§ïŒæ¬ç 究ã¯åèªðã®ãã¯ãã«ã
æçµçã«(ðð + ðð)ãšããïŒç²ŸåºŠãåäžãããããïŒ
ð¥ ððð¥ = 100, α = 0.75 ã®å Žå â
AdaGrad
(SGD)ã§åŠç¿
- 4. ðð
ð
ðð + ðð + ðð â log ðð,ð ã®çç±(1/4)
⢠åèªðãšåèªðã®ããåŽé¢ïŒaspectïŒã«ãããé¢ä¿
ãïŒæèåèªðã§è¡šãããšãèãã
⢠äŸ: ãç±ååŠãã«ãããiceãšsteam
⢠ðð,ð = ð(ð|ð)ãããðð,ð/ðð,ðã®æ¹ãåèªðãšåèªðã®
ç¹åŸŽãæããæèãšããŠæçšãã
⢠äŸ: waterãfashionãããsolidãgasã®æ¹ãæçš
Pennington+ (2014) GloVe: Global Vectors for Word Representation 4
- 5. ðð
ð
ðð + ðð + ðð â log ðð,ð ã®çç±(2/4)
⢠åèªð, ð, ðã®ãã¯ãã«ãããããðð, ðð, ð ðãšãã
⢠åèªãã¯ãã«ã§ðð,ð/ðð,ðãè¡šçŸããã«ã¯ïŒ
ð¹ ðð â ðð, ð ð = ðð,ð/ðð,ð
⢠巊蟺ã®åŒæ°ïŒãã¯ãã«ïŒãšå³èŸºïŒã¹ã«ã©ãŒïŒã®å
ãåãããããã®æãã·ã³ãã«ãªæ¹æ³ã¯ïŒ
ð¹ ðð â ðð
ð
ð ð = ðð,ð/ðð,ð
Pennington+ (2014) GloVe: Global Vectors for Word Representation 5
åèªðãšåèªðã®ç¹åŸŽã®å¯Ÿæ¯ããã¯ãã«ã®
å·®ã§è¡šçŸïŒå æ³æ§ææ§ãäºãèæ
®ïŒ
é¢æ°ã®åœ¢ã¯
ããããçµã蟌ã ð ðãšã¯å¥ç³»çµ±ã®ãã¯ãã«
å
ç©ããšã£ãŠã¹ã«ã©ãŒå€ã«ãã
- 6. ðð
ð
ðð + ðð + ðð â log ðð,ð ã®çç±(3/4)
⢠åèªãšæèã®åœ¹å²ã¯å
¥ãæ¿ããå¯èœã§ããã¹ã
⢠ðð â ððãšð â ð ðã®å
¥ãæ¿ããåæã«èæ
®ãã¹ã
⢠ð¹ãšããŠå æ³çŸ€ããä¹æ³çŸ€ãžã®æºåååå
exp: â â â+ãæ¡çšãã
exp ðð â ðð
ð
ð ð =
exp ðð
ð
ð ð
exp ðð
ð
ð ð
=
ðð,ð
ðð,ð
⢠ãããã£ãŠïŒ
exp ðð
ð
ð ð = ðð,ð = ðð,ð/ðð
⢠䞡蟺ã®å¯Ÿæ°ããšããšïŒ
ðð
ð
ð ð = log ðð,ð â log ðð
Pennington+ (2014) GloVe: Global Vectors for Word Representation 6
- 7. ðð
ð
ðð + ðð + ðð â log ðð,ð ã®çç±(4/4)
⢠ãŸã åèªãšæèã®å
¥ãæ¿ããã§ããªã
ðð
ð
ð ð = log ðð,ð â log ðð
⢠ðã«é¢ããå®æ°é
ããªããã
⢠log ððããã€ã¢ã¹é
ððã§è¡šãïŒæ°ãã«ðã«é¢
ãããã€ã¢ã¹é
ð ðãå°å
¥
ðð
ð
ð ð = log ðð,ð â ðð â ð ð
ðð
ð
ð ð + ðð + ð ð = log ðð,ð
Pennington+ (2014) GloVe: Global Vectors for Word Representation 7
- 8. ð(ðð,ð)ã§éã¿ä»ãããçç±
⢠ðð,ð = 0ã®ãšãlog ðð,ðãèšç®ã§ããªã
⢠å
±èµ·è¡åðã®èŠçŽ ã¯ã»ãšãã©ã0ïŒçè¡åïŒ
⢠圱é¿åã0ãšãã
⢠äœé »åºŠã®å
±èµ·é »åºŠã¯éèŠããªã
⢠äœé »åºŠãªå
±èµ·äºè±¡ããã¯ãã«ã§ç¬Šå·åããã®ã¯å€§å€
⢠ðð,ð < ð¥maxãªãã°åœ±é¿åã(ðð,ð/ð¥max) ðŒ
ãšãã
⢠é«é »åºŠã®å
±èµ·èŠçŽ ãéèŠããããªã
⢠é«é »åºŠãªå
±èµ·äºè±¡ã¯å¹³çã«ãã¯ãã«ã§ç¬Šå·å
⢠ðð,ð ⥠ð¥maxãªãã°åœ±é¿åã1ãšãã
Pennington+ (2014) GloVe: Global Vectors for Word Representation 8
- 9. Skip-gramãivLBLãšã®é¢ä¿ (1/2)
⢠ç®çé¢æ°ã¯ïŒ
ðœ = â
ðâððððð¢ð ,
ðâðððð¡ðð¥ð¡ ð
log ðð,ð ïŒðð,ð =
exp ðð
ð
ðð
ð=1
ð
exp ðð
ð
ð ð
⢠繰ãè¿ãåºçŸããåèªã»æèãã¢ããŸãšãããšïŒ
ðœ = â
ð=1
ð
ð=1
ð
ðð,ð log ðð,ð
⢠ðð,ð = ðð ðð,ðã§ããããïŒ
ðœ = â
ð=1
ð
ðð
ð=1
ð
ðð,ð log ðð,ð =
ð=1
ð
ðð ð»(ðð, ðð)
Pennington+ (2014) GloVe: Global Vectors for Word Representation 9
ã¯ãã¹ãšã³ããããŒ
- 10. Skip-gramãivLBLãšã®é¢ä¿ (2/2)
⢠Skip-gramãivLBLã¯ç¢ºçååžððãšððã®ã¯ãã¹ãšã³
ããããŒãæå°åããŠãã
⢠ã¯ãã¹ãšã³ããããŒã¯ãã³ã°ããŒã«ãªååžã«åããªã
ïŒäœé »åºŠãªäºè±¡ãèæ
®ããããïŒ
⢠ððã¯ç¢ºçååžãšããŠæ£èŠåãããã¹ã
⢠å®éã«ã¯ððã®åæ¯ã®èšç®ã倧å€ãªã®ã§æ£èŠåãããªãïŒè¿äŒŒïŒ
⢠ææ¡ææ³: äºä¹èª€å·®ã§ç¢ºçååžã®è·é¢ãèšç®
ðœ =
ð,ð=1
ð
ðð ðð,ð â ðð,ð
2
, ðð,ð = ðð,ð, ðð,ð = exp ðð
ð
ðð
⢠å®æ
ã¯å¯Ÿæ°ã®äºä¹èª€å·®: ðð,ð = log ðð,ð , ðð,ð = ðð
ð
ðð
Pennington+ (2014) GloVe: Global Vectors for Word Representation 10
- 11. è©äŸ¡ããŒã¿
⢠Word analogy (Mikolov+ 13)
⢠âa is to b as c is to d?â
⢠(a, b, c, d) = (Athens, Greece, Berlin, Germany)
⢠d: ð ð â ð ð + ð ðãšã®ã³ãµã€ã³é¡äŒŒåºŠãæãé«ãåèª
⢠Word similarity
⢠WordSim-353 (Finkelstein+ 01), MC (Miller+ 91), RG
(Rubenstein+ 65), SCWS (Huang+ 12), RW (Luong+ 13)
⢠åºæè¡šçŸæœåºïŒCoNLL-2003, ACE, MUC7ïŒ
⢠CRFã®çŽ æ§ã«50次å
ã®åèªãã¯ãã«ã®å€ãè¿œå
Pennington+ (2014) GloVe: Global Vectors for Word Representation 11
- 12. å®éšèšå®
⢠èšç·ŽããŒã¿ïŒããŒã¯ã³æ°ïŒ
⢠1.0B: 2010 Wikipedia dump
⢠1.6B: 2014 Wikipedia dump
⢠4.3B: Gigaword 5
⢠6.0B: Gigaword 5 + 2014 Wikipedia dump
⢠42B: WebææžïŒCommon CrawlïŒ
⢠åèª-æèå
±èµ·è¡åã®æ§ç¯
⢠Stanford tokenizer, å°æååã®åŸïŒé«é »åºŠãª400,000åèªãæ¡çš
⢠åèªã®å·ŠåŽã®10åèªãšå³åŽã®10åèªãæèèªãšãã
⢠åèªãšæèèªã®è·é¢ðã«å¿ããŠåºçŸé »åºŠã1/ðãšãã
⢠åŠç¿æã®ãã©ã¡ãŒã¿
⢠ð¥max = 100, α = 0.75, AdaGradã®åæåŠç¿ç0.05
⢠å埩åæ°ã¯50åïŒ300次å
æªæºã®å ŽåïŒãããã¯100å
⢠(ðð + ðð)ãåèªãã¯ãã«ãšãã
⢠æ¬æ¥ïŒå
±èµ·è¡åðã察称è¡åãªãððãš ððã¯ç䟡ã«ãªãã¯ã
⢠è€æ°ã®åŠç¿çµæãçµ±åããããšã§ãã€ãºèæ§ãåäžãããšæåŸ
⢠(ðð + ðð)ã«ããæ§èœã®åäžã¯å
ãã ãïŒword analogyã§ã¯åçãªåäž
Pennington+ (2014) GloVe: Global Vectors for Word Representation 12
- 13. ããŒã¹ã©ã€ã³ææ³
⢠Skip-gram (SG), Continuous BOW (CBOW)
⢠word2vecã®å®è£
ãå©çš
⢠åŠç¿å¯Ÿè±¡ã¯400,000åèªïŒæèå¹
10åèªïŒ10åèªãè² äŸãšã
ãŠãµã³ãã«
⢠SVD
⢠åŠç¿å¯Ÿè±¡ã10,000åèªãŸã§çµã蟌ã
⢠SVD: ðð,ð
⢠SVD-S: ðð,ð
⢠SVD-L: log(1 + ðð,ð)
⢠(i)vLBLã®çµæã¯è«æïŒMnih+ 13ïŒãã
⢠HPCAã¯å
¬éãããŠããåèªãã¯ãã«ãçšãã
Pennington+ (2014) GloVe: Global Vectors for Word Representation 13
- 14. Word analogyã¿ã¹ã¯ã®ç²ŸåºŠ
⢠GloVeã®å§å
⢠ããŒã¿éãå¢ããããšã§ç²Ÿ
床ãåäžããŠãã
⢠word2vecãä»ã®è«æã§
å ±åãããŠããæ°å€ãã
ãè¯ãã£ã
⢠äžå©ãªãã©ã¡ãŒã¿ãéžãã
èš³ã§ã¯ãªã
⢠SVDã¯ããŒã¿éãå¢ãã
ãŠã粟床ãäžãããªã
Pennington+ (2014) GloVe: Global Vectors for Word Representation 14