lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James liu" <liuping.ja...@gmail.com>
Subject Re: i wanna find one crawl that can crawl with defined urls and defined data
Date Tue, 01 May 2007 12:36:35 GMT
2007/4/30, Graeme Merrall <dasfreak@gmail.com>:
>
> > i wanna crawl http://www.amazone.com/  and just wanna product title ,
> > product information, writer, publisher.
> >
> > and other data i wanna ignore.
>
> How about
> http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html



i read it before this mail.


for example,
>
> i wanna crawl http://www.amazone.com/  and just wanna product title ,
> product information, writer, publisher.
>
> and other data i wanna ignore.
>
>
or if you're prepared to wait or help out there's
> http://svn.apache.org/repos/asf/labs/droids/README.TXT
>



-- 
regards
jl

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message