lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amit Sela <am...@infolinks.com>
Subject Re: Solr admin search with wildcard
Date Thu, 27 Jun 2013 13:41:09 GMT
Forgive my ignorance but I want to  be sure, do I add <copyField
source="iframe" dest="text"/> to solrindex-mapping.xml?
so that my solrindex-mapping.xml looks like this:
<fields>
                <field dest="content" source="content"/>
                <field dest="title" source="title"/>
                <field dest="iframe" source="iframe"/>
                <field dest="host" source="host"/>
                <field dest="segment" source="segment"/>
                <field dest="boost" source="boost"/>
                <field dest="digest" source="digest"/>
                <field dest="tstamp" source="tstamp"/>
                <field dest="id" source="url"/>
                <copyField source="url" dest="url"/>
                *<copyField source="iframe" dest="text"/> *
        </fields>
        <uniqueKey>url</uniqueKey>

And what do you mean by standard tokenization ?

Thanks!


On Thu, Jun 27, 2013 at 3:43 PM, Jack Krupansky <jack@basetechnology.com>wrote:

> Just <copyField> from the string field to a "text" field and use standard
> tokenization, then you can search the text field for "youtube" or even
> "something" that is a component of the URL path. No wildcard required.
>
>
> -- Jack Krupansky
>
> -----Original Message----- From: Amit Sela
> Sent: Thursday, June 27, 2013 8:37 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Solr admin search with wildcard
>
>
> The stored and indexed string is actually a url like "
> http://www.youtube.com/**somethingsomething<http://www.youtube.com/somethingsomething>
> ".
> It looks like removing the quotes does the job: iframe:*youtube* or am I
> wrong ? For now, performance is not an issue, but accuracy is and I would
> like to know for example how many URLS have iframe source leading to
> YouTube for example. So query like: iframe:*youtube* with max rows 10 or
> something will return in the response numFound field the total number of
> pages that have a tag ifarme with a source matching *youtube, No ?
>
>
> On Thu, Jun 27, 2013 at 3:24 PM, Jack Krupansky <jack@basetechnology.com>*
> *wrote:
>
>  No, you cannot use wildcards within a quoted term.
>>
>> Tell us a little more about what your strings look like. You might want to
>> consider tokenizing or using ngrams to avoid the need for wildcards.
>>
>> -- Jack Krupansky
>>
>> -----Original Message----- From: Amit Sela
>> Sent: Thursday, June 27, 2013 3:33 AM
>> To: solr-user@lucene.apache.org
>> Subject: Solr admin search with wildcard
>>
>>
>> I'm looking to search (in the solr admin search screen) a certain field
>> for:
>>
>> *youtube*
>>
>> I know that leading wildcards takes a lot of resources but I'm not worried
>> with that
>>
>> My only question is about the syntax, would this work:
>>
>> field:"*youtube*" ?
>>
>> Thanks,
>>
>> I'm using Solr 3.6.2
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message