DID_BASEFORM rule can be improved

The relevant rule id is DID_BASEFORM.

The part to amend is here:
<token skip="1" regexp="yes" postag="MD|VBD" postag_regexp="yes">did|will|could|can|should|would|does|ll</token>

I think the “ll” at the end is a typo. Anyway…
Suggestion:
<token skip="1" regexp="yes" postag="MD|VBD" postag_regexp="yes">did|will|could|can|should|would|does|shall|cannot|might|may</token>

Added to my task list.

Thanks, I’ve added those words. ll isn’t a typo, it’s used to match the contraction in e.g. we'll.