lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Isha Garg <isha.g...@orkash.com>
Subject Re: Carrot2 clustering component
Date Tue, 18 Jan 2011 06:54:39 GMT
On Tuesday 18 January 2011 11:12 AM, Otis Gospodnetic wrote:
> Isha,
>
> You'll get more and better help if you provide more details about what you have
> done, what you have tried, what isn't working, what errors or behaviour you are
> seeing, etc.
>
> Otis
> ----
> Sematext ::http://sematext.com/  :: Solr - Lucene - Nutch
> Lucene ecosystem search ::http://search-lucene.com/
>
>
>
> ----- Original Message ----
>    
>> From: Isha Garg<isha.garg@orkash.com>
>> To:solr-user@lucene.apache.org
>> Sent: Tue, January 18, 2011 12:38:03 AM
>> Subject: Carrot2 clustering component
>>
>> Hi,
>>     I am not able to understand the caarot2 clustering component  from
>>
>> http://wiki.apache.org/solr/ClusteringComponent
>>
>> please  provide me more detailed information if someone had already worked on
>> this. How  to run this and use this during search  query.
>>
>>
>> Thanks!
>> Isha
>>
>>
>>
>>
>>      
I had downloaded some jar files compatible with solr1.4  including:
carrot2-core-3.4.2.jar
guava-r05.jar
hppc-0.3.1.jar
jackson-core-asl-1.5.2.jar
mahout-collections-0.3.jar
jackson-mapper-asl-1.5.2.jar
log4j-1.2.14.jar
mahout-math-0.3.jar
simple-xml-2.3.5.jar
And placed them at contrib/clustering/lib

Then changed the solr.config as:

<requestHandler name="standard" default="true">
  <!-- default values for query parameters -->
  <lst name="defaults">
  <str name="echoParams">explicit</str>
  <!--
  <int name="rows">10</int>
  <str name="fl">*</str>
  <str name="version">2.1</str>
  -->
  <!--<bool name="clustering">true</bool>-->
  <str name="clustering.engine">default</str>
  <bool name="clustering.results">true</bool>
  <!-- The title field -->
  <str name="carrot.title">headin</str>
  <str name="carrot.url">id</str>
  <!-- The field to cluster on -->
  <str name="carrot.snippet">text</str>
  <!-- produce summaries -->
  <bool name="carrot.produceSummary">true</bool>
  <!-- the maximum number of labels per cluster -->
  <!--<int name="carrot.numDescriptions">5</int>-->
  <!-- produce sub clusters -->
  <bool name="carrot.outputSubClusters">false</bool>

  </lst>
  <arr name="last-components">
  <str>clustering</str>
  </arr>
  </requestHandler>

  <searchComponent name="clustering">
  <!-- Declare an engine -->
  <lst name="engine">
  <!-- The name, only one can be named "default" -->
  <str name="name">default</str>

  <str name="carrot.algorithm">org.carrot2.clustering.lingo.LingoClusteringAlgorithm</str>
  <str name="LingoClusteringAlgorithm.desiredClusterCountBase">20</str>
  </lst>
  <lst name="engine">
  <str name="name">stc</str>
  <str name="carrot.algorithm">org.carrot2.clustering.stc.STCClusteringAlgorithm</str>
  </lst>
  </searchComponent>


And then run solr using command:
java -Dsolr.clustering.enabled=true -jar start.jar


Now can you tell me where i am wrong ?????????? what else should i do?????????



Mime
View raw message