tagger
no way to compare when less than two revisions
Differences
This shows you the differences between two versions of the page.
— | tagger [2019-06-21 13:32:06] (current) – created - external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== Tagger and lemmatizer HOWTO ====== | ||
+ | ===== Installation ===== | ||
+ | |||
+ | < | ||
+ | > git clone https:// | ||
+ | > cd src/ | ||
+ | > vim Makefile.builtem | ||
+ | - C_FLAGS += -std=c++11 -W -Wall -mtune=generic -msse -msse2 -mfpmath=sse -fvisibility=hidden -U_FORTIFY_SOURCE | ||
+ | + C_FLAGS += -std=c++11 -W -Wall -march=native -fvisibility=hidden -U_FORTIFY_SOURCE | ||
+ | > make | ||
+ | </ | ||
+ | |||
+ | ===== Models ===== | ||
+ | |||
+ | Download, unzip: | ||
+ | |||
+ | Czech: https:// | ||
+ | |||
+ | English: https:// | ||
+ | |||
+ | (download link is at the bottom of the page) | ||
+ | |||
+ | (beware, the models may have a non-free license) | ||
+ | ===== Run tagger ===== | ||
+ | |||
+ | < | ||
+ | | ./ | ||
+ | |||
+ | ===== Run lemmatizer ===== | ||
+ | |||
+ | < | ||
+ | | ./ | ||
+ | czech-morfflex-pdt-131112-pos_only-raw_lemmas.tagger 2>/ | ||
+ | | cut -f 2 | tr " | ||
+ | </ | ||
+ | |||
+ | ===== Problems ===== | ||
+ | |||
+ | Loading big models takes several seconds, but the tagging itself is very fast. The new version contains REST server, so it can be started once and handle multiple requests. |
tagger.txt · Last modified: by 127.0.0.1