This is the very first post on the forum and I am very new on this platform. Please help me to understand the algorithm to set tag for each word and phrase level. I have been debugging LT code which I found the class EnglishChunker have PosTags for each token and the ChunkTag for phrase level tagging.
During functional spike testing, I have submitted a sentence “A new ROS, reactive oxygen species found.” into analyzeText which I found no posTag for the token ROS. On the other hand, PosTags in java code set the value NNS tag to the token ROS.
Is analyzeText Text Analysis - LanguageTool endpoint referred different from EnglishChunker/analyzeText?
Is there any scope of improvement on tagging? As I understood from several posts in this forum that the tagging is not 100%
How would LT handle abbreviations? Please refer the above sentence, ROS is an abbreviation.
As I am very much curious to know more about articles usage for abbreviations, please do share details with me.
Thanking you in advance.