lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marvin Humphrey <>
Subject Test corpus
Date Sat, 01 Apr 2006 23:54:16 GMT

I'm looking for a test corpus to use for some benchmarking and  
parsing tests.  I can whip one up myself, but it would be nice to use  
something standardized.  I'd like something that doesn't require a  
license/fee, so that other people can run the same tests.  At least  
1000 docs, a few hundred words each.  Any suggestions?

Marvin Humphrey
Rectangular Research

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message