uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Klügl (JIRA) <...@uima.apache.org>
Subject [jira] [Commented] (UIMA-4079) MarkTable action not able to recognize entities with two or more words
Date Sun, 02 Nov 2014 10:34:33 GMT

    [ https://issues.apache.org/jira/browse/UIMA-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14193774#comment-14193774
] 

Peter Klügl commented on UIMA-4079:
-----------------------------------

Yes, this would be the first point in my last comment. It will solve the problem for most
use cases but will not provide a general solution. It should be could enough for now. 

A new configuration parameter in the analysis engine will require the users to regenerate
their descriptors, so we should probably move to 2.3.0 instead of 2.2.2 for the next release.

> MarkTable action not able to recognize entities with two or more words
> ----------------------------------------------------------------------
>
>                 Key: UIMA-4079
>                 URL: https://issues.apache.org/jira/browse/UIMA-4079
>             Project: UIMA
>          Issue Type: Bug
>          Components: ruta
>    Affects Versions: 2.2.2ruta
>            Reporter: Silvestre Losada
>             Fix For: 2.2.2ruta
>
>
> I think this error was introduced solving UIMA-4071. The problem is that  RutaStream.getVisibleCoveredText
method removes whitespaces in covered text. For example Bill Clinton covered text returns
BillClinton.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message