Back to LanguageTool Homepage - Privacy - Imprint

XML tags, rules, wild cards and other miscellanea


(Irvine) #1

Okay, I have a few questions relevant to people trying to develop large, versatile rule sets:

1) I have noticed a few XML rule tags not available in the web based editor, is there a complete list of these tags anywhere? e.g. , and

2) Are there wild cards that can be used to form rules, for example:
any word
,

If so, this would not only be useful, but extremely powerful when used in conjunction with

3) I have a few test documents, (pieces of fiction I have written,) but is there a way of testing complex rules against Wikipedia?

4) The web based rule editor allows developers to analyse sentences and phrases, I don't suppose there's a stand-alone version of this? It would be really useful to be able to input text files for analysis, each line containing a sentence or phrase. With a couple of dozen lines of example text, the output could even be loaded into a spreadsheet. A workable format would be to have the parts of speech separated by spaces forming rows and the analysis of each sentence on newlines, forming the columns.


(Daniel Naber) #2

1) I have noticed a few XML rule tags not available in the web based
editor, is there a complete list of these tags anywhere? e.g.
, and

A lot of it is documented at
http://wiki.languagetool.org/development-overview. If you load the XML
file into an XML capable editor, it will also offer you completions for
element and attribute names.

2) Are there wild cards that can be used to form rules, for example:
any word

Use simply "" for any token, maybe in combination with the min
and max attributes.

3) I have a few test documents, (pieces of fiction I have written,)
but is there a way of testing complex rules against Wikipedia?

The online editor already runs against a subset of Wikipedia. How to run
against a local subset (or all of it, if you have plenty of time) is
documented at http://wiki.languagetool.org/developing-robust-rules#toc3

4) The web based rule editor allows developers to analyse sentences
and phrases, I don't suppose there's a stand-alone version of this? It
would be really useful to be able to input text files for analysis,

There's no stand-alone version. You can add any number of example
sentences in the first step of the rule editor. The editor needs to know
if the sentence is correct or not, that's why there's not simply a text
area where you can paste text.

Regards
Daniel