I would be happy to contribute to English and French grammar checker to first populate a database (unicode csv file for instance) of errors and fix sentences, as http://wiki.languagetool.org/english-error-collection[/quote]
Excellent ! I welcome help on the French version of LanguageTool, especially since I have less time now to contribute.
[quote=“artofit”]I just discovered both LT and Grammalecte(http://www.dicollecte.org/grammalecte/) and if I do understand that developpers have different views on how to implement matters(just as for instance gcc and clang), there is still one common objectif : check the grammatical rules of a given language.
So is there a common database / repository of both project?[/quote]
LT has tests but Grammalecte does not yet, as explained here http://www.dicollecte.org/thread.php?prj=fr&t=387.
LT and grammalecte are different, but there is a healthy cooperation. LT uses the dictionary from Dicollecte (transformed for LT). Both checkers borrowed some rule ideas from each other.
I think it would also be very useful for regression testing to measure differences between versions of false positive, true positive and unmatched.[/quote]
That can certainly be useful.
[quote=“artofit”]3/ “you can do already is to extract the incorrect sentences”
I’m ready to give it a go, I suppose you do not have a parser? If no, I can try with Perl’s regexp[/quote]
A simple grep in the French grammar.xml will give you all the examples :
$ grep ‘<example’ ./languagetool-language-modules/fr/src/main/resources/org/languagetool/rules/fr/grammar.xml
Il était sensé l’accompagner.
Il était censé l’accompagner.
Jimmy Hendrix est né à Seattle.
Jimi Hendrix est né à Seattle.
However, be aware that the example are for a single rule only. In fact some “correct” example may trigger errors in other rules. That’s not desirable but it can happen. For example, I see such an example to test rule DESSINER_UN_DESSIN:
<example type="incorrect"><marker>dessiner un dessin</marker></example>
<example type="correct"><marker>faire un dessin</marker></example>
This example is is good enough to test rule DESSINER_UN_DESSIN. However, it would trigger an error if we used it to test all other rules (missing uppercase at the beginning of the sentence). Ideally all examples should not trigger errors in any of the rules, but that is unfortunately not the case in many old rules such as the above example. It would be nice to improve those examples so that they are correct when checking all rules.
4/ How do you want to exchange/share info & files? On the forum?[/quote]
Here, or in the mailing list. I personally prefer the mailing list, but feel free to do it how you prefer.