nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tobias Zahn <Tobias-Z...@arcor.de>
Subject 'RegexIndexingFilter'
Date Mon, 29 Jan 2007 18:57:47 GMT
Good evening!
I have found out that it is impossible to index only some specific file
types with nutch. Needing this feature, I thought of implementing an
'RegexIndexingFilter', if that would be the right thing to do so.
I have read some sourcecode, but I couldn't find out how to tell the
indexer that he shouldn't index a file.

Hoping that I am on the right way I hope for your opinions, ideas and
your help.

TIA,
Tobias Zahn

Mime
View raw message