Where can I find the stand alone version of the routine that segments our texts into sentences using our srx file?
I need that to split raw text input (mainly paragraphs) into sentences, just like LT does.
For 2 reasons:
1 Checking if it needs improvement
2 To align the corpus into LT-ready sentences
I manged to find a version: https://github.com/loomchild/segment/releases