nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nadav Hashimshony" <>
Subject Cant run twice get in SegmentReader
Date Wed, 06 Feb 2008 08:56:16 GMT

i am using the the SegmentReader object.

i am trying to run the following code

SegmentReader sgm = new SegmentReader(conf,true,true,true,true,true,true);

sgm.get(new Path(input),new Text(URL1),
                            new OutputStreamWriter(System.out, "UTF-8"),
                            new HashMap());

                    sgm.get(new Path(input),new Text(URL2).toString()),
                            new OutputStreamWriter(System.out, "UTF-8"),
                            new HashMap());

Where the URL1 and URL2 are valid urls in the nutch DB, i used readseg
option in the bin/nutch to view them.
when i run the code in the debugger i see data for the first get run,
in the second get run i see no data.

Any idea why i cant run the get method twice??


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message