uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marshall Schor (JIRA)" <...@uima.apache.org>
Subject [jira] [Commented] (UIMA-5662) uv3 support CAS deserialization subsequent low level access
Date Wed, 03 Jan 2018 13:59:00 GMT

    [ https://issues.apache.org/jira/browse/UIMA-5662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16309678#comment-16309678
] 

Marshall Schor commented on UIMA-5662:
--------------------------------------

I'm trying to support XCAS and Xmi in this new mode, as well.  

For Xmi, the serialized form may contain sequences of UIMA Lists, encoded as just the item
values; this serialization doesn't have any fsId information for these.  (Note: some list
elements may be multiply referenced; these will have fsIds).  For the missing fsId case, I'm
thinking of assigning fsIds to these, following the deserialization.  

XCAS should be OK - all Feature Structures (I believe) have id's in the serialized format.

> uv3 support CAS deserialization subsequent low level access
> -----------------------------------------------------------
>
>                 Key: UIMA-5662
>                 URL: https://issues.apache.org/jira/browse/UIMA-5662
>             Project: UIMA
>          Issue Type: Improvement
>          Components: Core Java Framework
>    Affects Versions: 3.0.0SDK-beta
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 3.0.0SDK
>
>
> Some users depend 1) constant v2-ids for FSs preserved in deserialization and serialization,
and 2) low level cas API access to these.
> V3 normally doesn't maintain tables linking ids to FSs, as these (unless weak refs are
used) prevent GC of unreachable FSs.
> Based on a mode, set by -Duima.deserialize_perserve_ids, and also controllable by new
config option per deserialize call, alter the deserialization for those deserializers which
know about v2 ids, to put these into the map used for low-level CAS access, using the actual
v2 ids, and change the v3 next available id for future new FSs to be 1 beyond the end.
> The -Duima.deserialize-preserve_ids global setting is needed to handle the use case of
some annotators using low-level APIs, when part of a pipeline is "remoted". 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message