@dnaber, thank you.
Initial tests show that the file/rule does exactly what I want.
Now I have a related question to you and the team about disambiguation:
My test term is The Fat Cat.
The multiword chunker removes
DT from The and all the readings except
NNP from Cat. I guess it also removes readings from Fat, but I cannot see that.
JJ to Fat. For test purposes, I thought to put an antipattern on the rule to prevent the rule from changing a multi-word proper noun. But, the rule has this example:
<example type="ambiguous" inputform="Canadian[Canadian/JJ,Canadian/NNP]" outputform="Canadian[Canadian/JJ]">The <marker>Canadian</marker> Badlands is nice.</example>
As it happens, Badlands is
NNP, thus an exception for a sequence of 2 proper nouns would mean that in this context, Canadian is not disambiguated as
JJ. I think that it should not be
JJ, but rather, the correct disambiguation is to make Canadian Badlands a proper noun. (Rule
NNP to Badlands.)
If a term is a multi-word proper noun, should a disambiguator rule that changes only a single token change the
I don’t plan to change any rules at this stage, but I would like comments/suggestions from the team about what the correct analysis is.