nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-2079) Tika Parsing plugin issue
Date Sat, 29 Aug 2015 02:08:46 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-2079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Lewis John McGibbney updated NUTCH-2079:
----------------------------------------
    Fix Version/s:     (was: 2.3)
                   2.4.1

> Tika Parsing plugin issue
> -------------------------
>
>                 Key: NUTCH-2079
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2079
>             Project: Nutch
>          Issue Type: New Feature
>          Components: deployment
>    Affects Versions: 2.3
>         Environment: Ubuntu 14.04
>            Reporter: Pradumna Panditrao
>             Fix For: 2.4
>
>
> Hi,
> I am trying to parse particular data & post the same on the mongodb, however when
I am trying to do some modifications into into parse tika plugin, it has too much inter connectivity
with other classes & it misses the data. I want to pick up particular data from website
using the same plugin & put into mongo db.
> Please suggest for the same.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message