lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shai Erera (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (LUCENE-5599) HttpReplicator uses a lot of CPU for large files
Date Wed, 16 Apr 2014 14:44:21 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shai Erera resolved LUCENE-5599.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 5.0
                   4.9
         Assignee: Shai Erera

Thanks Christoph, this is really a silly bug, nice catch!

I've committed to trunk and 4x.

> HttpReplicator uses a lot of CPU for large files
> ------------------------------------------------
>
>                 Key: LUCENE-5599
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5599
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/replicator
>    Affects Versions: 4.7.1
>            Reporter: Christoph Kaser
>            Assignee: Shai Erera
>            Priority: Minor
>             Fix For: 4.9, 5.0
>
>         Attachments: HttpClientBase.java.patch
>
>
> The method responseInputStream of HttpClientBase wraps an InputStream in order to close
it when it is done reading. However, the wrapper only overwrites the single-byte read() method,
every other method is delegated to its parent (java.io.InputStream). Therefore, the more efficient
read-methods like read(byte[] b) are all implemented by reading one byte after the other.
> In my test, it took 20 minutes to copy  an index of 38 GB. With the provided small patch,
this was reduced to less than 10 minutes.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message