lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <hossman_luc...@fucit.org>
Subject Re: UUIDUpdateProcessorFactory causes repeated documents when uploading csv files?
Date Thu, 08 Jan 2015 01:47:08 GMT

: It's a single Solr Instance, and in my files, I used 'doc_key' everywhere,
: but I changed it to "id" in the email I sent out wanting to make it easier
: to read, sorry don't mean to confuse you :)

https://wiki.apache.org/solr/UsingMailingLists

- what version of solr?
- how exactly are you doing the update? curl? post.jar?
- what exactly is the HTTP response from your update?
- what does your log file show during the update?
- what exactly do all of your configs look like (you said you made a 
mistake in your email by trying to make the data "easier to read" that 
could easily be masking some other mistake in your actual cnfigs

I did my best to try and reproduce what you describe, but i had no 
luck -- here's exactly what i did...


hossman@frisbee:~/lucene/lucene-4.10.3_tag$ svn diff
Index: solr/example/solr/collection1/conf/solrconfig.xml
===================================================================
--- solr/example/solr/collection1/conf/solrconfig.xml	(revision 1650199)
+++ solr/example/solr/collection1/conf/solrconfig.xml	(working copy)
@@ -1076,7 +1076,17 @@
          <str name="update.chain">dedupe</str>
        </lst>
        -->
+    <lst name="defaults">
+      <str name="update.chain">autoGenId</str>
+    </lst>
   </requestHandler>
+  <updateRequestProcessorChain name="autoGenId">
+    <processor class="solr.UUIDUpdateProcessorFactory">
+      <str name="fieldName">id</str>
+    </processor>
+    <processor class="solr.LogUpdateProcessorFactory" />
+    <processor class="solr.RunUpdateProcessorFactory" />
+  </updateRequestProcessorChain>
 
   <!-- The following are implicitly added
   <requestHandler name="/update/json" class="solr.UpdateRequestHandler">
hossman@frisbee:~/lucene/lucene-4.10.3_tag$ curl -X POST 'http://localhost:8983/solr/collection1/update?commit=true'
-H "Content-Type: application/csv" --data-binary 'foo_s,bar_s
aaa,cat
bbb,dog
ccc,yak
'
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int 
name="QTime">350</int></lst>
</response>
hossman@frisbee:~/lucene/lucene-4.10.3_tag$ curl 'http://localhost:8983/solr/collection1/select?q=*:*&wt=json&indent=true'
{
  "responseHeader":{
    "status":0,
    "QTime":7,
    "params":{
      "indent":"true",
      "q":"*:*",
      "wt":"json"}},
  "response":{"numFound":3,"start":0,"docs":[
      {
        "foo_s":"aaa",
        "bar_s":"cat",
        "id":"025c69cd-6407-4c70-903b-dfde170d373b",
        "_version_":1489692576651935744},
      {
        "foo_s":"bbb",
        "bar_s":"dog",
        "id":"5c7b3d65-1274-4bad-a671-4d643531e2ae",
        "_version_":1489692576673955840},
      {
        "foo_s":"ccc",
        "bar_s":"yak",
        "id":"25a3893f-c538-4b47-aa79-1f4268d66c39",
        "_version_":1489692576673955841}]
  }}







-Hoss
http://www.lucidworks.com/

Mime
View raw message