Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sun, 02 Dec, 23:00 |
Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Tue, 04 Dec, 13:19 |
Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Wed, 26 Dec, 18:08 |
Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Sun, 30 Dec, 11:16 |
Andrea Spinelli (JIRA) |
[jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Tue, 04 Dec, 11:15 |
Andrzej Bialecki |
Re: Filter spam URLs |
Fri, 07 Dec, 13:51 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Mon, 03 Dec, 23:29 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file |
Tue, 04 Dec, 11:51 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release |
Tue, 11 Dec, 01:08 |
Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-586) Add option to run compiled classes w/o job file |
Mon, 17 Dec, 18:24 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format |
Thu, 27 Dec, 11:51 |
Andrzej Bialecki (JIRA) |
[jira] Created: (NUTCH-595) "Target file:/.... already exists" |
Thu, 27 Dec, 13:08 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-534) SegmentMerger: add -normalize option |
Thu, 27 Dec, 13:36 |
Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Mon, 03 Dec, 20:20 |
Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Mon, 03 Dec, 20:26 |
Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release |
Mon, 03 Dec, 23:35 |
Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release |
Mon, 03 Dec, 23:39 |
Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Mon, 03 Dec, 23:43 |
Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Tue, 04 Dec, 02:03 |
Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Tue, 04 Dec, 19:15 |
Dennis Kubes (JIRA) |
[jira] Commented: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release |
Mon, 10 Dec, 23:46 |
Dennis Kubes (JIRA) |
[jira] Created: (NUTCH-594) Serve Nutch search results in XML and JSON |
Fri, 21 Dec, 17:10 |
Dennis Kubes (JIRA) |
[jira] Updated: (NUTCH-594) Serve Nutch search results in XML and JSON |
Fri, 21 Dec, 17:18 |
Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Sun, 16 Dec, 15:23 |
Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED |
Sun, 16 Dec, 15:23 |
Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format |
Thu, 27 Dec, 10:30 |
Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-595) "Target file:/.... already exists" |
Thu, 27 Dec, 13:21 |
Emmanuel Joke (JIRA) |
[jira] Commented: (NUTCH-534) SegmentMerger: add -normalize option |
Thu, 27 Dec, 13:26 |
Emmanuel Joke (JIRA) |
[jira] Updated: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format |
Fri, 28 Dec, 02:59 |
Emmanuel Joke (JIRA) |
[jira] Created: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Sun, 30 Dec, 09:52 |
Enis Soztutar (JIRA) |
[jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file |
Tue, 04 Dec, 09:59 |
Enis Soztutar (JIRA) |
[jira] Updated: (NUTCH-586) Add option to run compiled classes w/o job file |
Tue, 04 Dec, 13:21 |
Enis Soztutar (JIRA) |
[jira] Resolved: (NUTCH-588) Help Need |
Fri, 07 Dec, 10:46 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly |
Wed, 05 Dec, 05:47 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file |
Tue, 18 Dec, 04:20 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null |
Tue, 25 Dec, 04:19 |
Joseph Chen (JIRA) |
[jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest |
Tue, 18 Dec, 23:34 |
Les Cheong (JIRA) |
[jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server |
Wed, 12 Dec, 19:34 |
Lirida Kercelli |
scoring algorithm |
Sun, 23 Dec, 14:00 |
Matt Kangas (JIRA) |
[jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Tue, 04 Dec, 21:43 |
Nathaniel Powell (JIRA) |
[jira] Created: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point |
Thu, 06 Dec, 00:59 |
Ned Rockson |
Task process exit with nonzero status of 65 |
Mon, 03 Dec, 21:36 |
Ned Rockson |
Filter spam URLs |
Fri, 07 Dec, 01:14 |
Neumann, Vladimir |
cached.jsp for the new dev-version |
Thu, 13 Dec, 10:24 |
Nigel Daley |
Hudson Upgrade Dec 19 |
Wed, 19 Dec, 06:59 |
Nigel Daley |
Re: Hudson Upgrade Dec 19 |
Thu, 20 Dec, 19:45 |
Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Sun, 02 Dec, 17:26 |
Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sun, 02 Dec, 21:36 |
Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 03 Dec, 06:45 |
Peter Boot |
errors compiling index-extra |
Fri, 21 Dec, 04:25 |
Peter Boot (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Fri, 21 Dec, 21:17 |
Remco Verhoef (JIRA) |
[jira] Created: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. |
Sun, 30 Dec, 16:29 |
Remco Verhoef (JIRA) |
[jira] Updated: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. |
Sun, 30 Dec, 16:31 |
Ryan Levering (JIRA) |
[jira] Created: (NUTCH-589) Hierarchical Classloaders |
Wed, 05 Dec, 00:04 |
Teccon Ingenieros (JIRA) |
[jira] Created: (NUTCH-588) Help Need |
Tue, 04 Dec, 16:40 |
Torontoer |
Enable Nutch to search for local file system |
Mon, 24 Dec, 03:33 |
Trey Spiva |
Re: Image Search Engine Input |
Sun, 02 Dec, 02:30 |
frank ling (JIRA) |
[jira] Created: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. |
Fri, 14 Dec, 00:47 |
hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #307 |
Fri, 28 Dec, 05:08 |
hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #308 |
Sat, 29 Dec, 04:10 |
hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #309 |
Sat, 29 Dec, 05:33 |
hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #311 |
Mon, 31 Dec, 04:34 |
lv david |
Re: some question about development |
Sat, 01 Dec, 11:20 |
novikov1 |
cached.jsp for the new dev-version |
Thu, 13 Dec, 10:59 |
patil |
fnm frq like files are not creating while crwaling some site |
Wed, 12 Dec, 09:59 |
patil |
files are not generated in index folder by indexer for the site http://www.traguiden.se(for other sites its working good) while crwaling |
Fri, 14 Dec, 06:25 |
quxy |
Nutch\nutch-0.9\build.xml:61: Specify at least one source--a file or resource collection. |
Wed, 05 Dec, 04:06 |
sudarat (JIRA) |
[jira] Created: (NUTCH-593) Nutch crawl problem |
Wed, 19 Dec, 02:49 |