manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shinichiro Abe <shinichiro.ab...@gmail.com>
Subject Re: Text extraction On ManifoldCF
Date Mon, 07 Mar 2011 13:35:12 GMT
Thank you for your reply. It worked well.
My mistake.There was something wrong with fmap value settings.
In sample document  content is mapped to "text", which is indexed but not stored.
So,I couldn't that text when the document is retrieved. 

Shinichiro Abe

On 2011/03/07, at 20:16, Karl Wright wrote:

> I think you probably want to post this question to
> java-user@lucene.apache.org.  That's the new "solr-user" list.
> 
> Thanks, and let us know what the issue is, when you find out.
> 
> Karl
> 
> On Mon, Mar 7, 2011 at 5:20 AM, Shinichiro Abe
> <shinichiro.abe.1@gmail.com> wrote:
>> Hello.
>> 
>> When Text Extraction on Solr side,  I executed the following to get metadata of xls/doc/pdf
files in Solr Example.
>> I could get metadata(name,size,etc.) on Solr, but could not get content extracted
text .
>> I couldn't find string field like "s_content".
>> 
>> 1. solrconfig.xml
>>   In ExtractingRequestHandler
>>   <str name="uprefix">s_</str>
>> 2. schema.xml
>>   <dynamicField name="s_*"  type="text"  indexed="true"  stored="true"/>
>> 3. On ManifoldCF, it processed the Job(Windows shares (or filesystem) repository
to Solr output connector).
>> 
>> I guess it have to set up using Solr Cell setting for get content .
>> What do I need to do?(Solr user's question?)
>> 
>> Regards,
>> Abe
>> 


Mime
View raw message