uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Eckart de Castilho (JIRA)" <...@uima.apache.org>
Subject [jira] [Commented] (UIMA-4685) Add Common Load/Save APIs, including type system and index defs, for various CAS formats
Date Wed, 24 Aug 2016 09:40:20 GMT

    [ https://issues.apache.org/jira/browse/UIMA-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434606#comment-15434606

Richard Eckart de Castilho commented on UIMA-4685:

[~schor] how would you suggest to implement this method?

The lowest level of the load methods still accepts a TSI input stream and internally - as
far as I can see - all methods expect that a full CASMgrSerializer can be obtained from that
stream. You had changed that from proactively loading the CASMgrSerializer to lazily loading

Do you see any way of supporting the load(IS, CAS, TS)  method without a lot of code duplication
and without switching back to proactively loading the CASMgrSerializer from the TSI stream
for the other load methods? One option I see would be to introduce indirections like "TypeSystemSource"
and maybe "IndexDefinitionSource" which could be either pre-initialized with a TS (or CASMgrSerializer)
or which could load them on demand from a stream.

> Add Common Load/Save APIs, including type system and index defs, for various CAS formats
> ----------------------------------------------------------------------------------------
>                 Key: UIMA-4685
>                 URL: https://issues.apache.org/jira/browse/UIMA-4685
>             Project: UIMA
>          Issue Type: New Feature
>          Components: CasEditor, Core Java Framework
>    Affects Versions: 2.8.1SDK
>            Reporter: Peter Kl├╝gl
>            Assignee: Marshall Schor
>             Fix For: 2.9.0SDK
>         Attachments: BinaryCasIOUtils.java, caseditor.csv, caseditor_bincas.csv
> Create a new CasIOUtils class having common load and save APIs, with optional saving
of type system and index definitions, in all the various CAS serialization formats (except
JSON).  Implement heuristic discovery for deserialization to support automatic selection of
the right kind of deserializer.  Update the headers for binary to support indicating they
have the type system included (not used yet).  Update the SerialFormat to support this.  Update
the CasEditor and other components to use this new API.

This message was sent by Atlassian JIRA

View raw message