nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "" <>
Subject Patch about get url which contains Chinese words
Date Tue, 11 Jan 2011 00:49:27 GMT
Hi guys:

The urls of some files on the internet may contains Chinese or other
unicode words. For example中文.pdf

But nutch can't encode it well. So I give this patch using URL using
URLEncoder to encode urls correctly.


View raw message