lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Archambault" <tim.j.archamba...@gmail.com>
Subject Re: solr instances for different content?
Date Mon, 05 Nov 2007 15:27:29 GMT
Good points Grant. I'm envisioning my front end working so that a user would
never be able to search across all the verticals at once.

EVERY query would inject "vertical:jobs" or "vertical:news" or
"vertical:Autos", etc.. etc...

This may detrimentally affect my faceted results sets so I'll have to think
about this more.

Wouldn't this approach overcome my relevancy and scoring issues?

On 11/5/07, Grant Ingersoll <gsingers@apache.org> wrote:
>
> One reason to consider separate indexes is in terms of relevance.  Do
> you want content from classifieds effecting the rankings of your news
> searches?  May not be an issue for you depending on your term
> distributions, but might be something to consider.    As you suspect,
> though, having multiple indexes will require more management of the
> various instances.  Perhaps you can logically group things to only
> have a couple of indexes?  For instance, maybe home, auto, classifieds
> are similar in content and structure and news and community-generated
> content are similar?
>
> -Grant
>
> On Nov 5, 2007, at 9:34 AM, Tim Archambault wrote:
>
> > Typical newspaper site with: news, jobs, homes, autos, classifieds,
> > community-generated content, guestimate of .5 million documents
> >
> > Do I really need to create a different solr index for each vertical?
> > How
> > ineffecient is it to add a few additional fields for each content
> > type?
> >
> > Thinking of having a string field name "vertical" that would be used
> > to
> > segment by verticals above.
> >
> > My intuition is that most of the additional fields would be numbers:
> > integers, prices, decimals.
> >
> > Thanks,
> >
> > Tim
> >
> > --
> > True innovation is not just about changing a product, a service or
> > even a
> > marketplace; its also about recognizing and relishing the need to
> > change
> > yourself.
>
> --------------------------
> Grant Ingersoll
> http://lucene.grantingersoll.com
>
> Lucene Boot Camp Training:
> ApacheCon Atlanta, Nov. 12, 2007.  Sign up now!  http://www.apachecon.com
>
> Lucene Helpful Hints:
> http://wiki.apache.org/lucene-java/BasicsOfPerformance
> http://wiki.apache.org/lucene-java/LuceneFAQ
>
>
>


-- 
True innovation is not just about changing a product, a service or even a
marketplace; its also about recognizing and relishing the need to change
yourself.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message