The impulse to do this came from what Anna did on Russian word processor manuals in a class I taught.
Jan made the Czech National Corpus, Prague Dependency Treebank. He is Eva’s son. Eva has, inter alia, worked for the Spielberg Foundation on collecting and indexing the memories of Shoah survivors No large annotated corpora of Russian exist in the public domain. But large annotated corpora of Czech do Largely thanks to Jan Hajic and predecessors in Prague.
Roman Jakobson was alleged to speak 9 languages, all of the Russian.
A model of the way parts of speech succeed each other and the way words come out of parts of speech.
A model of the way parts of speech succeed each other and the way words come out of parts of speech. In Russian. Of course, this could mean “The red book”.
A model of the way parts of speech succeed each other and the way words come out of parts of speech. In Russian. Of course, this could mean “The red book”.
A model of the way parts of speech succeed each other and the way words come out of parts of speech. In Russian. Of course, this could mean “The red book”.
Russian and Czech are sufficiently similar that the tags can be the same (a mild lie). But the words are often different.
Jiri Hana helped us make a morphological analyser for Russian. Doesn’t choose which possibility is right, just lays out the possibilities. We assign probabilities, but they are not very accurate, and they don’t need to be, because the other probabilities do the work.