lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: types of formats who support Lucene?
Date Thu, 02 Dec 2004 13:29:55 GMT
Lucene itself indexes java.lang.String or data.  It is 
completely up to your application to parse the data out of whatever 
source it is in and hand it to Lucene.  There are a number of 
open-source libraries that make parsing XML, MS Word, Excel, HTML, and 
other formats trivial.  If you search the e-mail list archives you'll 
find pointers to tons of options.


On Dec 2, 2004, at 7:36 AM, Daniel Cortes wrote:

> Hi I''m newer in this mail list and what you can see my English is 
> very terrible.
> I 'm having a study to select the best technology  for a motor 
> serching of an application web with a ratio of 1000 users/day.
> I  read a little bit of Lucene what I don't know what file types 
> support the search.
> If you can reply my or say me a page that tells this I regret you.
> Thanks of a "novatillo"
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message