[GSoC reports] spellchecker, server-side framework and build tool tasks

oserikov · August 14, 2018, 11:59am

GSoC 2018 Work Summary

What was done

During this Summer of Code I worked on several tasks.
First, the improvment of spellchecker suggestions sorting using machine learning approach included the following submissions in the languagetool repository on GitHub:

#1020 (merged)
#1115 (merged)

Code for the model learning part is in this repo.
The ordering of suggestions is now done with the predictions of the trained model (xgboost was used), the quality of the resulting sorting was improved.

Second, switching to the modern server-side framework:

#1046 (open)

Third, migration from Maven to Gradle:

#1045 (open)

Other submissions:

#940 (merged) - fix for the issue #373
#920 (merged) - fix for the issue #652

Future works

I’ m willing to continue contributing to languagetool outside GSoC, in particular I plan to do the following within my project:

further improve the ml model quality (parameter tuning, feature engineering, adding new features)
finish transition to Gradle
finish transition to Spring
adress all suggested corrections and get open PRs merged.