I’m trying to create a tagger dict for a new language, following https://dev.languagetool.org/developing-a-tagger-dictionary. The language has high inflection, especially for verbs. One verb can take up to 100+ inflections, as they take different forms for 5 moods, 3 tenses, 3 persons, 2 numbers, 2 voices + some other inflections combined with pronouns.
I’ve devised some tags like:
VB - Verb, IND=Indicative, PS=Present, I - First person, S - Singular
Do I create a separate Tag for each combination like VB_IND_PS_I_S for inflection in Indicative, Present, first-person singular or there’s a way to combine them somehow?
Also, some parts of speech are multi-word entries or compound words, how do I represent them in the dict input file?
Your help is very much appreciated,