tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-605) Tika GDAL parser
Date Sat, 11 Oct 2014 15:20:33 GMT

     [ https://issues.apache.org/jira/browse/TIKA-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Chris A. Mattmann updated TIKA-605:
    Attachment: TIKA-605.Mattmann.101114.patch.2.txt

- this patch fully works but I had to drop direct support for the ExternalParser (see the
method comments) and bring in some of that functionality directly into this class. This is
due to the ExternalParser not really handling the case where I need to get Metadata *and*
text output from the external command output, and I need the metadata *first* before I call
the handler.

- added in a test for a FITS file as well.

- will be adding docs on the wiki for this soon. Hope to get this committed in the next few

FITS file located here: http://fits.gsfc.nasa.gov/samples/WFPC2u5780205r_c0fx.fits

> Tika GDAL parser
> ----------------
>                 Key: TIKA-605
>                 URL: https://issues.apache.org/jira/browse/TIKA-605
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>         Environment: indep. of env.
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>              Labels: gdal, gsoc2013, integration, mentor, tika
>             Fix For: 1.7
>         Attachments: 0001-TIKA-605-Tika-GDAL-parser.patch, TIKA-605.Mattmann.092511.patch.txt,
TIKA-605.Mattmann.100914.1.patch.txt, TIKA-605.Mattmann.100914.2.patch.txt, TIKA-605.Mattmann.101114.patch.2.txt
> Leverage the GDAL toolkit and its Java SWIG bindings to create a Tika parser around GDAL.
See here: http://trac.osgeo.org/gdal/browser/trunk/gdal/swig/java/apps/gdalinfo.java

This message was sent by Atlassian JIRA

View raw message