Back to LanguageTool Homepage - Privacy - Imprint

Using tags


(SafeTex) #1

Last question tonite

I'm looking at rules (already written into LT) and parts of them like

its

(a|the|your|her|his|our|their|mine|ours|theirs|yours)

I'm a bit surprised to see the rule written in this way as you could use tagsets such as

DT Determiner: an, an, all, many, much, any, some, this
JJ Adjective: beautiful, large, inspectable (to indicate a wrong sentence like 'its beautiful)

OK. I can't write rules I know but I'd like to learn. Could the above rule (extract) be improved by using tagsets?

Also, is it an 'error' to include 'a' but not 'an' in the regular expression above?

I'm just trying to read rules for the moment to prepare the way to actually write one myself

Thanks


(Daniel Naber) #2

On Sa 03.11.2012, 14:00:29 you wrote:

OK. I can't write rules I know but I'd like to learn. Could the above
rule (extract) be improved by using tagsets?

It could be written in a more compact way, but whether it's really an
improvement needs to be tested, as this could lead to false alarms. Testing
can be done by the rule creator[1] in expert mode, and by locally checking
against Wikipedia, as described on our Wiki.

Also, is it an 'error' to include 'a' but not 'an' in the regular
expression above?

Indeed, "an" seems to be missing from the list.

[1] http://www.languagetool.org/ruleeditor/

Regards
Daniel

--
http://www.danielnaber.de