uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Burn Lewis (JIRA)" <uima-...@incubator.apache.org>
Subject [jira] Commented: (UIMA-1041) UIMACPP Pythonator issues with annotation offsets and lengths - off by 1 errors
Date Thu, 29 May 2008 13:20:45 GMT

    [ https://issues.apache.org/jira/browse/UIMA-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12600766#action_12600766

Burn Lewis commented on UIMA-1041:

FWIW - on Windows the offsets look correct, using a build from src.  Python & Perl &
Tcl XCASes look OK in CVD.

But the tcl example generates duplicate annotations and gets an overflow error ... problem
seems to be that the tcl regexp match sometimes returns pairs such as:
65 69
65 69
-1 -1
I tried the ActiveTcl 8.4 build and also built 8.4 & 8.5

> UIMACPP Pythonator issues with annotation offsets and lengths - off by 1 errors
> -------------------------------------------------------------------------------
>                 Key: UIMA-1041
>                 URL: https://issues.apache.org/jira/browse/UIMA-1041
>             Project: UIMA
>          Issue Type: Bug
>          Components: C++ Framework
>         Environment: RedHat, UIMACPP 2.2.2 release candidate 01, uima base 2.2.2
>            Reporter: Marshall Schor
> The sample python script when run in the document analyzer shows annotations where the
highlight is always missing the last character, and the details show the offsets for the begin
and end to be both one to low.
> To reproduce, run the sample script in the python directory of the scriptators (after
doing a build /install of the pythonator following the directions in the python directory
in python.html).

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message