spaCy lightning talk for KyivPy #21

spaCy to the rescue
or why NLTK is not cool anymore
Anton Kasyanov | DataRobot

What is spaCy
• Natural language processing library

• Industrial strength - based on latest research

• Fast - written using Cython

Usage
import spacy
nlp = spacy.load(‘en')
doc = nlp(
‘Hello, world.’
‘Here are two sentences.’
)

Tokeniser
token = doc[0]
sentence = next(doc.sents)
assert token is sentence[0]
assert sentence.text ==
‘Hello, world.'

Word Vectors
doc = nlp(“Apples and oranges are similar.
Boots and hippos aren’t.")
apples = doc[0]
oranges = doc[2]
boots = doc[6]
hippos = doc[8]
assert apples.similarity(oranges) >
boots.similarity(hippos)

Syntactic Parser

Speed

Other features
• Part-of-Speech tagger

• Named entities recognition

• Integer IDs for words

• Multi-threading support

• Deep learning

• German, English, French (so far)

Thanks!
https://spacy.io
antonkasyanov.com

spaCy lightning talk for KyivPy #21

More Related Content

What's hot

More from Anton Kasyanov

Recently uploaded

spaCy lightning talk for KyivPy #21