lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: extract and add fields on the fly
Date Wed, 28 Jan 2015 18:38:40 GMT
Well, the schema does need to know what type your field is. If you
can't add it to schema, use dynamicFields with prefixe/suffixes or
dynamic schema (less recommended).

Regards,
   Alex.
----
Sign up for my Solr resources newsletter at http://www.solr-start.com/


On 28 January 2015 at 13:32, Mark <javamark@gmail.com> wrote:
> That approach works although as suspected the schma has to recognise the
> additinal facet (stuff in this case):
>
> "responseHeader":{"status":400,"QTime":1},"error":{"msg":"ERROR:
> [doc=000000006252671B765A1748992DF1A6403BDF81A4A15E00] unknown field
> 'stuff'","code":400}}
>
> ..getting closer..
>
> On 28 January 2015 at 18:03, Mark <javamark@gmail.com> wrote:
>
>>
>> Use case is
>>
>> use curl to upload/extract/index document passing in additional facets not
>> present in the document e.g. literal.source="old system"
>>
>> In this way some fields come from the uploaded extracted content and some
>> fields as specified in the curl URL
>>
>> Hope that's clearer?
>>
>> Regards
>>
>> Mark
>>
>>
>> On 28 January 2015 at 17:54, Alexandre Rafalovitch <arafalov@gmail.com>
>> wrote:
>>
>>> Sounds like 'literal.X' syntax from
>>>
>>> https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika
>>>
>>> Can you explain your use case as different from what's already
>>> documented? May be easier to understand.
>>>
>>> Regards,
>>>    Alex.
>>> ----
>>> Sign up for my Solr resources newsletter at http://www.solr-start.com/
>>>
>>>
>>> On 28 January 2015 at 12:45, Mark <javamark@gmail.com> wrote:
>>> > I'm looking to
>>> >
>>> > 1) upload a binary document using curl
>>> > 2) add some additional facets
>>> >
>>> > Specifically my question is can this be achieved in 1 curl operation or
>>> > does it need 2?
>>> >
>>> > On 28 January 2015 at 17:43, Mark <javamark@gmail.com> wrote:
>>> >
>>> >>
>>> >> Second thoughts SID is purely i/p as its name suggests :)
>>> >>
>>> >> I think a better approach would be
>>> >>
>>> >> 1) curl to upload/extract passing docID
>>> >> 2) curl to update additional fields for that docID
>>> >>
>>> >>
>>> >>
>>> >> On 28 January 2015 at 17:30, Mark <javamark@gmail.com> wrote:
>>> >>
>>> >>>
>>> >>> "Create the SID from the existing doc" implies that a document already
>>> >>> exists that you wish to add fields to.
>>> >>>
>>> >>> However if the document is a binary are you suggesting
>>> >>>
>>> >>> 1) curl to upload/extract passing docID
>>> >>> 2) obtain a SID based off docID
>>> >>> 3) add addtinal fields to SID & commit
>>> >>>
>>> >>> I know I'm possibly wandering into the schemaless teritory here
as
>>> well
>>> >>>
>>> >>>
>>> >>> On 28 January 2015 at 17:11, Andrew Pawloski <apawloski@gmail.com>
>>> wrote:
>>> >>>
>>> >>>> I would switch the order of those. Add the new fields and *then*
>>> index to
>>> >>>> solr.
>>> >>>>
>>> >>>> We do something similar when we create SolrInputDocuments that
are
>>> pushed
>>> >>>> to solr. Create the SID from the existing doc, add any additional
>>> fields,
>>> >>>> then add to solr.
>>> >>>>
>>> >>>> On Wed, Jan 28, 2015 at 11:56 AM, Mark <javamark@gmail.com>
wrote:
>>> >>>>
>>> >>>> > Is it possible to use curl to upload a document (for extract
&
>>> >>>> indexing)
>>> >>>> > and specify some fields on the fly?
>>> >>>> >
>>> >>>> > sort of:
>>> >>>> > 1) index this document
>>> >>>> > 2) by the way here are some important facets whilst your
at it
>>> >>>> >
>>> >>>> > Regards
>>> >>>> >
>>> >>>> > Mark
>>> >>>> >
>>> >>>>
>>> >>>
>>> >>>
>>> >>
>>>
>>
>>

Mime
View raw message