lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael McCandless <>
Subject Re: Anticipating a benchmark for direct posting format
Date Thu, 03 Apr 2014 15:37:44 GMT
Is the benchmark just trying to measure speedups by using DirectPF vs
the default PF?  You could do this today w/ luceneutil (using
Wikipedia as content).

But if you have another content source / index, I'm happy to run the
benchmark.  It'd be easier to make the content available (CSV, or line
docs file format), then ship around big indices ...

I have a box with 48 GB RAM.

Mike McCandless

On Thu, Apr 3, 2014 at 8:38 AM, Benson Margulies <> wrote:
> Some of you may recall that I started a thread some time ago about
> wishing for the benefits of the direct posting format without needing
> to use a codec. The thread landed as a challenge: show a benchmark of
> the benefit of D-P-F.
> After a lot of distraction, I'm now in a position to build it. The
> core is a rather large index, and to show the effect (always assuming
> that I succeed) will take a machine with a large amount of RAM.
> One approach is for me to simply build the index involved and make it
> available as an index. Another would be to side-step into a giant pile
> of  CSV or JSON and provide a do-it-yourself kit.
> Anyone have a preference?
> What have we got for hardware with, 40G of RAM? Anything, or will this
> be up to individuals to try out on dayjob hardware?
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message