Embed presentation
Download as PDF, PPTX



![Anton Kasyanov | DataRobot
Tokeniser
token = doc[0]
sentence = next(doc.sents)
assert token is sentence[0]
assert sentence.text ==
‘Hello, world.'](https://image.slidesharecdn.com/spacykyivpy-170812103228/85/spaCy-lightning-talk-for-KyivPy-21-4-320.jpg)
![Anton Kasyanov | DataRobot
Word Vectors
doc = nlp(“Apples and oranges are similar.
Boots and hippos aren’t.")
apples = doc[0]
oranges = doc[2]
boots = doc[6]
hippos = doc[8]
assert apples.similarity(oranges) >
boots.similarity(hippos)](https://image.slidesharecdn.com/spacykyivpy-170812103228/85/spaCy-lightning-talk-for-KyivPy-21-5-320.jpg)




Spacy is introduced as a natural language processing library that is faster and more feature-rich than the popular but older NLTK library. It can tokenize text, identify parts of speech, recognize named entities, and perform syntactic parsing. Spacy uses word vectors and deep learning for tasks like measuring word similarity and is industrial strength, written in Cython for speed and supporting multiple languages so far.



![Anton Kasyanov | DataRobot
Tokeniser
token = doc[0]
sentence = next(doc.sents)
assert token is sentence[0]
assert sentence.text ==
‘Hello, world.'](https://image.slidesharecdn.com/spacykyivpy-170812103228/85/spaCy-lightning-talk-for-KyivPy-21-4-320.jpg)
![Anton Kasyanov | DataRobot
Word Vectors
doc = nlp(“Apples and oranges are similar.
Boots and hippos aren’t.")
apples = doc[0]
oranges = doc[2]
boots = doc[6]
hippos = doc[8]
assert apples.similarity(oranges) >
boots.similarity(hippos)](https://image.slidesharecdn.com/spacykyivpy-170812103228/85/spaCy-lightning-talk-for-KyivPy-21-5-320.jpg)



