tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arturo Beltran (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-443) Geographic Information Parser
Date Tue, 22 Jun 2010 09:55:05 GMT

    [ https://issues.apache.org/jira/browse/TIKA-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12881146#action_12881146

Arturo Beltran commented on TIKA-443:

I'm not convinced about using OGDI. From what I understand from reading the documentation,
OGDI offers an API in C, so we encounter the same problem to integrate it with Java. In addition,
the project is not updated since 2008, so new geographic formats are not supported (i.e: KML).
Also, I think OGDI does not support databases or services.

However, you can do some proof of concept to see if it would be very difficult to integrate
with Java and see exactly what metadata can be extracted using OGDI. Then we can compare these
results with mine and decide. 

As you can see, I've attached a sample XML file (getFDOMetadata.xml) that contains the information
extracted of a SHP by my proof of concept server based on FDO. This is the result after a
simple HTTP call (http://localhost:12345/getFDOMetadata?source=C:\ExampleData\shp_world_countries\country.shp&provider=SHP)

For now, I'll keep trying to run muy "Hello world" Tika parser.


> Geographic Information Parser
> -----------------------------
>                 Key: TIKA-443
>                 URL: https://issues.apache.org/jira/browse/TIKA-443
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Arturo Beltran
>         Attachments: getFDOMetadata.xml
> I'm working in the automatic description of geospatial resources, and I think that might
be interesting to incorporate new parser/s to Tika in order to manage and describe some geo-formats.
These geo-formats include files, services and databases.
> If anyone is interested in this issue or want to collaborate do not hesitate to contact
me. Any help is welcome.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message