lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From eShard <>
Subject how to get solr (tika?) to capture more metadata from RSS feed?
Date Fri, 01 Mar 2013 15:35:34 GMT
I have a lot of non standard IBM RSS feeds that needs to be crawled (via
ManifoldCF v1.1.1) and put into solr 4.0 final.
The problem is that we need to put the additional non standard metadata into
I've confirmed via fiddler that manifoldcf is indeed sending all the
appropriate metadata but something in solr is removing all of it. It's
either tika, rome or something else in solr.
see this link for more details  tika post

So, is there a way to configure tika (or rome which handles RSS parsing) to
capture the additional metadata?
I read that the tika config file is deprecated or obsolete. Is that true?


View this message in context:
Sent from the Solr - User mailing list archive at

View raw message