nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Nutch Wiki] Update of "RunningNutchAndSolr" by Dmitrius
Date Mon, 10 May 2010 12:08:34 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "RunningNutchAndSolr" page has been changed by Dmitrius.


  The above command will generate a new segment directory under crawl/segments that at this
point contains files that store the url(s) to be fetched. In the following commands we need
the latest segment dir as parameter so we’ll store it in an environment variable:
- export SEGMENT=crawl/segments/`ls -tr crawl/segments|tail -1`
+ export SEGMENT=crawl/segments/&#96;ls -tr crawl/segments|tail -1&#96;
  Now I launch the fetcher that actually goes to get the content:

View raw message