nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "luoleicn@gmail.com" <luole...@gmail.com>
Subject Patch about get url which contains Chinese words
Date Tue, 11 Jan 2011 00:49:27 GMT
Hi guys:

The urls of some files on the internet may contains Chinese or other
unicode words. For example

http://www.example.com/中文.pdf

But nutch can't encode it well. So I give this patch using URL using
URLEncoder to encode urls correctly.


罗磊

Mime
View raw message