lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shalin Shekhar Mangar (JIRA)" <>
Subject [jira] [Created] (SOLR-8582) /update/json/docs is 4x slower than /update for indexing a list of json docs
Date Fri, 22 Jan 2016 11:16:39 GMT
Shalin Shekhar Mangar created SOLR-8582:

             Summary: /update/json/docs is 4x slower than /update for indexing a list of json
                 Key: SOLR-8582
             Project: Solr
          Issue Type: Bug
          Components: update
            Reporter: Shalin Shekhar Mangar
             Fix For: 5.5, Trunk

Indexing a ~650 MB json file containing a list of 2.2 million json documents, I found that
bin/post had become 4x slower after SOLR-7042. Memory consumption has also gone up and I can
no longer index this file with a 512mb heap.

The difference is because we now default to /update/json/docs instead of /update. This can
be verified on trunk:
time curl 'http://localhost:8983/solr/gettingstarted/update' --data-binary @/hdd/solr-data/imdb.json

real	2m42.044s
user	0m0.292s
sys	0m0.493s
time curl 'http://localhost:8983/solr/gettingstarted/update/json/docs' --data-binary @/hdd/solr-data/imdb.json

real	11m26.478s
user	0m0.324s
sys	0m0.552s

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message