Back to LanguageTool Homepage - Privacy - Imprint

Spellchecker improvement discussion


(Oleg) #41

@dnaber, Could you, please, run the updated features extractor?
And could you also SELECT DISTINCT rule_id FROM corrections WHERE rule_id LIKE "MORFOLOGIK_%";?


(Daniel Naber) #42
MORFOLOGIK_RULE_PL_PL
MORFOLOGIK_RULE_RU_RU
MORFOLOGIK_RULE_CA_ES
MORFOLOGIK_RULE_EN_US
MORFOLOGIK_RULE_UK_UA
MORFOLOGIK_RULE_IT_IT
MORFOLOGIK_RULE_ES
MORFOLOGIK_RULE_EN_GB
MORFOLOGIK_RULE_RO_RO
MORFOLOGIK_RULE_SL_SI
MORFOLOGIK_RULE_EN_AU
MORFOLOGIK_RULE_NL_NL
MORFOLOGIK_RULE_SK_SK
MORFOLOGIK_RULE_AST
MORFOLOGIK_RULE_EL_GR
MORFOLOGIK_RULE_EN_NZ
MORFOLOGIK_RULE_TL
MORFOLOGIK_RULE_BE_BY
MORFOLOGIK_RULE_EN_CA
MORFOLOGIK_RULE_BR_FR
MORFOLOGIK_RULE_EN_ZA
MORFOLOGIK_RULE_SR_EKAVIAN

Will send result of feature extractor soon.


(Oleg) #43

features extractor was erroneously containing a mistake – it worked mostly with en- language based records. Could you, please, run the updated features extractor?


(Daniel Naber) #44

Done, but now the result is rather small (3MB).


(Oleg) #45

I’ve improved the errors handling, so could you, please, run the updated features extractor one more time?


(Oleg) #46

So there are morfologik rules that were never logged (i.e. invoked)? For example “MORFOLOGIK_RULE_DE_DE”.


(Daniel Naber) #47

Sorry, I forgot about our rule ids being inconsistent. German rules are: AUSTRIAN_GERMAN_SPELLER_RULE, GERMAN_SPELLER_RULE, SWISS_GERMAN_SPELLER_RULE


(Oleg) #48

Ok, and are there any other languages with morfologik rules named non-morfologik way?


(Daniel Naber) #49

Not sure, please check the getId() of all classes extending SpellingCheckRule.


(Oleg) #50

Ok, thanks!