manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: Text extraction On ManifoldCF
Date Mon, 07 Mar 2011 11:16:30 GMT
I think you probably want to post this question to
java-user@lucene.apache.org.  That's the new "solr-user" list.

Thanks, and let us know what the issue is, when you find out.

Karl

On Mon, Mar 7, 2011 at 5:20 AM, Shinichiro Abe
<shinichiro.abe.1@gmail.com> wrote:
> Hello.
>
> When Text Extraction on Solr side,  I executed the following to get metadata of xls/doc/pdf
files in Solr Example.
> I could get metadata(name,size,etc.) on Solr, but could not get content extracted text
.
> I couldn't find string field like "s_content".
>
> 1. solrconfig.xml
>   In ExtractingRequestHandler
>   <str name="uprefix">s_</str>
> 2. schema.xml
>   <dynamicField name="s_*"  type="text"  indexed="true"  stored="true"/>
> 3. On ManifoldCF, it processed the Job(Windows shares (or filesystem) repository to Solr
output connector).
>
> I guess it have to set up using Solr Cell setting for get content .
> What do I need to do?(Solr user's question?)
>
> Regards,
> Abe
>

Mime
View raw message