tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ewan Mellor (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-2613) Tesseract 4.0 has removed -psm, so Tika must update
Date Mon, 26 Mar 2018 23:24:00 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ewan Mellor updated TIKA-2613:
------------------------------
    Description: 
Tesseract 4.0 (currently in beta-1) has removed the {{\-psm}} flag, in favor of {{\-\-psm}}
(with two dashes).

The {{\-\-psm}} variant was introduced in Nov 2016, so it should be safe to simply switch
Tika to use the two-dash variant, even for people still using Tesseract 3.05.

For reference, the Tesseract cset is:

{code}
commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
Author: Stefan Weil <sw@weilnetz.de>
Date: Sun Mar 25 17:28:33 2018 +0200

Remove deprecated support for -psm argument (#1419)

It was replaced by --psm and deprecated in commit 92d981b93.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
{code}

  was:
Tesseract 4.0 (currently in beta-1) has removed the {-psm} flag, in favor of {--psm} (with
two dashes).

The {--psm} variant was introduced in Nov 2016, so it should be safe to simply switch Tika
to use the two-dash variant, even for people still using Tesseract 3.05.

For reference, the Tesseract cset is:

{code}
commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
Author: Stefan Weil <sw@weilnetz.de>
Date: Sun Mar 25 17:28:33 2018 +0200

Remove deprecated support for -psm argument (#1419)

It was replaced by --psm and deprecated in commit 92d981b93.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
{code}


> Tesseract 4.0 has removed -psm, so Tika must update
> ---------------------------------------------------
>
>                 Key: TIKA-2613
>                 URL: https://issues.apache.org/jira/browse/TIKA-2613
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.17
>            Reporter: Ewan Mellor
>            Priority: Major
>
> Tesseract 4.0 (currently in beta-1) has removed the {{\-psm}} flag, in favor of {{\-\-psm}}
(with two dashes).
> The {{\-\-psm}} variant was introduced in Nov 2016, so it should be safe to simply switch
Tika to use the two-dash variant, even for people still using Tesseract 3.05.
> For reference, the Tesseract cset is:
> {code}
> commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
> Author: Stefan Weil <sw@weilnetz.de>
> Date: Sun Mar 25 17:28:33 2018 +0200
> Remove deprecated support for -psm argument (#1419)
> It was replaced by --psm and deprecated in commit 92d981b93.
> Signed-off-by: Stefan Weil <sw@weilnetz.de>
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message