Need help with ngrams

Hello world :wink:

I’ve downloaded LT

git clone --depth 5 GitHub - languagetool-org/languagetool: Style and Grammar Checker for 25+ Languages

and then I’ve compiled the dev version, in order to have access to ngram feats

cd languagetool/languagetool-dev; mvn clean package -DskipTests

I have made a csv file with bigrams for a language and I want to make use of it in LT, as per

https://www.google.com/search?q=org.languagetool.dev.frequencyindexcreator+site%3Aforum.languagetool.org

But I get errors

java -cp ./languagetool/languagetool-dev/target/languagetool-dev-6.4-SNAPSHOT.jar org.languagetool.dev.bigdata.FrequencyIndexCreator lucene ./input/2 ./output/2grams

but still not there :frowning:

Mode: Lucene
Minimum year: 1910
Ignore POS tags: true
Total input bytes: 4096
Exception in thread “main” java.lang.NoClassDefFoundError: org/apache/lucene/analysis/Analyzer
at org.languagetool.dev.bigdata.FrequencyIndexCreator.run(FrequencyIndexCreator.java:88)
at org.languagetool.dev.bigdata.FrequencyIndexCreator.main(FrequencyIndexCreator.java:375)
Caused by: java.lang.ClassNotFoundException: org.apache.lucene.analysis.Analyzer
at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:581)
at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178)
at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:527)
… 2 more

Can someone help me please?

@LanguageTool