Should LT have a postag for acronyms? (I will add ANOVA to spelling; the closest postag is NN:U.)
That might be a good idea - at least I wouldn’t mind.
Another -probably- good extension: a postag for punctuation marks (,;.:-_?!).
In German, there is the postag “PKT” for these non-word characters and I have used it quite often to write rules.
I agree.
I like that idea.
New postag PCT available since yesterday
@Knorr, great, thank you.
Should tagset.txt contain a reference to the new postag? I thought about adding a note, but I am not sure because the PCT postag is in disambiguation, unlike all the other postags.
Well, I have copied the solution from the German disambiguation.xml. The German tagset.txt has not been modified after this extension. So, I think it’s not necessary to change tagset.txt.