tagger
no way to compare when less than two revisions
Differences
This shows you the differences between two versions of the page.
| — | tagger [2019-06-21 11:32:06] (current) – created - external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| + | ====== Tagger and lemmatizer HOWTO ====== | ||
| + | ===== Installation ===== | ||
| + | |||
| + | < | ||
| + | > git clone https:// | ||
| + | > cd src/ | ||
| + | > vim Makefile.builtem | ||
| + | - C_FLAGS += -std=c++11 -W -Wall -mtune=generic -msse -msse2 -mfpmath=sse -fvisibility=hidden -U_FORTIFY_SOURCE | ||
| + | + C_FLAGS += -std=c++11 -W -Wall -march=native -fvisibility=hidden -U_FORTIFY_SOURCE | ||
| + | > make | ||
| + | </ | ||
| + | |||
| + | ===== Models ===== | ||
| + | |||
| + | Download, unzip: | ||
| + | |||
| + | Czech: https:// | ||
| + | |||
| + | English: https:// | ||
| + | |||
| + | (download link is at the bottom of the page) | ||
| + | |||
| + | (beware, the models may have a non-free license) | ||
| + | ===== Run tagger ===== | ||
| + | |||
| + | < | ||
| + | | ./ | ||
| + | |||
| + | ===== Run lemmatizer ===== | ||
| + | |||
| + | < | ||
| + | | ./ | ||
| + | czech-morfflex-pdt-131112-pos_only-raw_lemmas.tagger 2>/ | ||
| + | | cut -f 2 | tr " | ||
| + | </ | ||
| + | |||
| + | ===== Problems ===== | ||
| + | |||
| + | Loading big models takes several seconds, but the tagging itself is very fast. The new version contains REST server, so it can be started once and handle multiple requests. | ||
tagger.txt · Last modified: by 127.0.0.1
