It would be great if more info on a word could be stored in a database in LT. I would like to store:
- number of compounding parts
- if this grapheme is the most ‘standard’ one, or has an optional hyphen, or optional diacritics or both
- the more common synonyms
and in fact anything else one might want to flag a word for. Even if it is right or wrong or unknown status
One of those flags is the postag.
Did any of you ever consider expanding the postag-file to accomodate other type of flags on a word? And adjust the postag code to work only on the psotag part of the file?