uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Kottmann (JIRA) <...@uima.apache.org>
Subject [jira] Updated: (UIMA-1782) Encoding of text files during import should be confugurable
Date Mon, 17 May 2010 11:10:44 GMT

     [ https://issues.apache.org/jira/browse/UIMA-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jörn Kottmann updated UIMA-1782:

         Assignee: Jörn Kottmann
    Fix Version/s: 2.3.1

> Encoding of text files during import should be confugurable
> -----------------------------------------------------------
>                 Key: UIMA-1782
>                 URL: https://issues.apache.org/jira/browse/UIMA-1782
>             Project: UIMA
>          Issue Type: Improvement
>          Components: CasEditor
>    Affects Versions: 2.3
>            Reporter: Thomas Hampp
>            Assignee: Jörn Kottmann
>             Fix For: 2.3.1
> During import of text files into a corpus it seems to be impossible to control the encoding
used. Looks like the default platform encoding is used (Latin 1 on Western Windows systems).
The Eclipse default encoding settings for text files don't seem to affect import encoding.
That makes it impossible to import documents with international characters in UTF8.
> Ideally the encoding should be selectable in a drop down field in the import wizard.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message