Hi all, so we have downloaded the LT packages and have setup a local environment for the same. As we are primarily copyeditors and want the system to aid us with some of the common tasks of copyediting, we are going through all the grammar rules one by one.
Now the LT is a very comprehensive system, and has almost 1500+ rules in the grammar. xml, so we wanted to have a limited sample space for testing, also we would like to tailor it a bit for copyeditors. Our efforts at understanding the system have raised some questions and it would be very helpful to get some clarity on these issues. The questions are as follows:
Is it possible to remove almost all the rules from grammar.xml and just retain a select few?
Is there any documentation for various rules that are not a part of grammar.xml but are part of the main .jar files? If yes can we access them?
Is there any documentation for non-technical users (with some base working knowledge of coding and Xmls) for editing the grammar.xml?
Our observations were that there are many rules in the grammar.xml, which are very much instance specific. And we are attempting to generate rules which could be more comprehensive using POS tags and Chunker tags.
However, is there any documentation apart from the wiki pages which describe the function of the various tags in the grammar.xml?