I would love to use a deep learning based approach for spelling error detection.
I intend to use a char-level model and the dataset i intend to use is the billion word dataset.
some of the examples of error that can be detected are :
original : he had dated forI much of the past
corrected : he had dated for much of the past
original : Since then, the bigjest players in
corrected : Since then, the biggest players in
original : in te third quarter of last year,
corrected : in the third quarter of last year,