uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Klügl (JIRA) <...@uima.apache.org>
Subject [jira] [Commented] (UIMA-2452) Complex Types
Date Thu, 02 Aug 2012 09:43:02 GMT

    [ https://issues.apache.org/jira/browse/UIMA-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427208#comment-13427208
] 

Peter Klügl commented on UIMA-2452:
-----------------------------------

The problem here is that there are different interpretations what a sentence, date and time
is or should look like. All but line are somewhat domain or language dependent. A new seeder
only for a line annotation...

Date and time are yet more complex to define and rather a use case for TextMarker rules that
are used as a library.

I'd rather pursue different approaches:
- adding those annotation to the cas/xmi before TextMarker rules are applied, e.g., by a sentence
splitter analysis engine
- adding an analysis engine, e.g., to the descriptor/utils folder and calling it within a
TextMarker script in order to add those annotation.

You could of course also write our own seeder and configure the TextMarker analysis engine
template in a project so that more annotations are automatically created but that new feature
isn't well tested yet.

Anyways, that is just my opinion and I am open to discussion.

                
> Complex Types
> -------------
>
>                 Key: UIMA-2452
>                 URL: https://issues.apache.org/jira/browse/UIMA-2452
>             Project: UIMA
>          Issue Type: New Feature
>          Components: TextMarker
>            Reporter: Philipp Daumke
>
> Hi Peter,
> I think, some more complex types would really be a useful add-on. Currently I have in
mind:
> SENTENCE
> LINE
> DATE
> TIME
> Best regards
> Philipp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message