nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Frank McCown <fmcc...@harding.edu>
Subject Re: Support for Sitemap Protocol and Canonical URLs
Date Wed, 20 May 2009 21:05:25 GMT
My class ran out of time before we could integrate our project into
Nutch, but our Sitemap parser is available for anyone who would like
to integrate it with Nutch:

Java Sitemap Parser
http://sourceforge.net/projects/sitemap-parser/

-- 
Frank McCown, Ph.D.
Assistant Professor of Computer Science
Harding University
http://www.harding.edu/fmccown/

Mime
View raw message