lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: Files included from the default SolrConfig
Date Wed, 05 Jun 2013 11:52:50 GMT
1. SolrCell (ExtractingRequestHandler) - extract and index content from rich 
documents, such as PDF, Office docs, HTML (uses Tika)
2. Clustering - for result clustering.
3. Language identification (two update processors) - analyzes text of fields 
to determine language code.

None of those is mandatory, which is why they have separate libs.

-- Jack Krupansky

-----Original Message----- 
From: Raheel Hasan
Sent: Wednesday, June 05, 2013 5:57 AM
To: solr-user@lucene.apache.org
Subject: Files included from the default SolrConfig

Hi,

I am trying to optimize solr.

The default solrConfig that comes with solr>collection1 has a lot of libs
included I dont really need. Perhaps if someone could help we identifying
the purpose. (I only import from DIH):

Please tell me whats in these:
contrib/extraction/lib
solr-cell-

contrib/clustering/lib
solr-clustering-

contrib/langid/lib/
solr-langid


-- 
Regards,
Raheel Hasan 


Mime
View raw message