tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mane (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1121) Socket server text parsing error on large text files
Date Wed, 20 Nov 2013 15:09:36 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13827739#comment-13827739
] 

Mane commented on TIKA-1121:
----------------------------

Is there any update on this? I am trying to parse large html/pdf files and tika socket server
stops responding. Also, I tried Tika Jaxrs network app server which throws 500 Server Error.
Can someone look into this please?

> Socket server text parsing error on large text files
> ----------------------------------------------------
>
>                 Key: TIKA-1121
>                 URL: https://issues.apache.org/jira/browse/TIKA-1121
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.4
>         Environment: Ubuntu 10.04, 10.10, 12.04.02
>            Reporter: Dave Meikle
>            Assignee: Dave Meikle
>
> As reported on the user list[1], when using the tika-app socket server command with the
-t switch to parse text, the process hangs on large text files.
> This occurs on Ubuntu 10.04, 10.10 and 12.04.02.
> [1]http://mail-archives.apache.org/mod_mbox/tika-user/201305.mbox/%3CCAGxBzUFxSJ4h5jWdeUX9HhD2FxtTQ1vsbM7u-VfSyGE9VmrQHQ@mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message