I’m interested in expanding the N-gram rules for English.
I have downloaded the N-gram data set and have set up ConfusionRuleEvaluator in my IDE. I then need to give the arguments.
I am fine with the first 2 arguments, but the third one requires a list of correct sentences. The documentation states that a good start is a combination of Tatoeba and Wikipedia sentences for English. Where can I access/generate this file of example sentences?