lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl <jan....@cominvent.com>
Subject Re: URL search and indexing
Date Tue, 25 Jun 2013 10:28:25 GMT
Probably a good match for the RegExp feature of Solr (given that your url is not tokenized)
e.g. q=url:/.*\.it$/

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com

25. juni 2013 kl. 12:17 skrev Flavio Pompermaier <pompermaier@okkam.it>:

> Hi to everybody,
> I'm quite new to Solr so maybe my question could be trivial for you..
> In my use case I have to index stuff contained in some URL so i use url as
> key of my document and I treat it like a string.
> 
> However I'd like to be able to query by domain name, like *.it or *.
> somesite.com, what's the best strategy? I tought to made a URL to path
> transfromation and indexed using solr.PathHierarchyTokenizerFactory but
> maybe there's a simpler solution..isn't it?
> 
> Best,
> Flavio
> 
> -- 
> 
> Flavio Pompermaier
> *Development Department
> *_______________________________________________
> *OKKAM**Srl **- www.okkam.it*
> 
> *Phone:* +(39) 0461 283 702
> *Fax:* + (39) 0461 186 6433
> *Email:* f.pompermaier@okkam.it
> *Headquarters:* Trento (Italy), fraz. Villazzano, Salita dei Molini 2
> *Registered office:* Trento (Italy), via Segantini 23
> 
> Confidentially notice. This e-mail transmission may contain legally
> privileged and/or confidential information. Please do not read it if you
> are not the intended recipient(S). Any use, distribution, reproduction or
> disclosure by any other person is strictly prohibited. If you have received
> this e-mail in error, please notify the sender and destroy the original
> transmission and its attachments without reading or saving it in any manner.


Mime
View raw message