Lukáš Vlček |
Re: My ApacheconNA 2010 slides |
Sun, 07 Nov, 16:13 |
Alex McLintock (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 14:37 |
Alexis (JIRA) |
[jira] Commented: (NUTCH-873) Ivy configuration settings don't include Gora |
Fri, 05 Nov, 19:42 |
Alexis (JIRA) |
[jira] Issue Comment Edited: (NUTCH-873) Ivy configuration settings don't include Gora |
Fri, 05 Nov, 19:48 |
Alexis (JIRA) |
[jira] Issue Comment Edited: (NUTCH-873) Ivy configuration settings don't include Gora |
Fri, 05 Nov, 19:52 |
Alexis (JIRA) |
[jira] Issue Comment Edited: (NUTCH-873) Ivy configuration settings don't include Gora |
Fri, 05 Nov, 19:54 |
Alexis (JIRA) |
[jira] Commented: (NUTCH-880) REST API for Nutch |
Sat, 06 Nov, 00:26 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 04 Nov, 18:46 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 04 Nov, 18:48 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 04 Nov, 20:54 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 04 Nov, 21:06 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-880) REST API for Nutch |
Sat, 06 Nov, 01:32 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Sat, 06 Nov, 01:38 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Fri, 12 Nov, 13:29 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Fri, 12 Nov, 14:57 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-938) Imposible to fetch sites with robots.txt |
Wed, 24 Nov, 22:54 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 25 Nov, 12:04 |
Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON |
Thu, 25 Nov, 12:20 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-938) Imposible to fetch sites with robots.txt |
Thu, 25 Nov, 12:50 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 16:15 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1293 |
Mon, 01 Nov, 04:35 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1294 |
Tue, 02 Nov, 04:03 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1295 |
Wed, 03 Nov, 04:08 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1296 |
Thu, 04 Nov, 04:08 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1297 |
Fri, 05 Nov, 04:05 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1298 |
Sat, 06 Nov, 04:03 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1299 |
Sun, 07 Nov, 04:02 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1300 |
Mon, 08 Nov, 04:02 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1301 |
Tue, 09 Nov, 04:13 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1303 |
Thu, 11 Nov, 06:10 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1306 |
Sun, 14 Nov, 06:10 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1307 |
Mon, 15 Nov, 04:09 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1308 |
Tue, 16 Nov, 04:02 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1309 |
Wed, 17 Nov, 05:52 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1310 |
Thu, 18 Nov, 05:00 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1311 |
Fri, 19 Nov, 04:31 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1312 |
Sat, 20 Nov, 04:42 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1313 |
Sun, 21 Nov, 04:22 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1314 |
Mon, 22 Nov, 04:04 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1315 |
Tue, 23 Nov, 07:34 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1316 |
Wed, 24 Nov, 07:01 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1317 |
Thu, 25 Nov, 10:42 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1318 |
Fri, 26 Nov, 05:55 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1319 |
Sat, 27 Nov, 07:28 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1320 |
Sun, 28 Nov, 04:01 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1321 |
Mon, 29 Nov, 04:09 |
Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #1322 |
Tue, 30 Nov, 04:01 |
Apache Wiki |
[Nutch Wiki] Update of "PublicServers" by search2.net |
Tue, 02 Nov, 05:00 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "PublicServers" by seegnify |
Fri, 05 Nov, 21:35 |
Apache Wiki |
[Nutch Wiki] Update of "GORA_HBase" by Alexis |
Fri, 05 Nov, 22:31 |
Apache Wiki |
[Nutch Wiki] Update of "RunNutchInEclipse" by store88 |
Tue, 09 Nov, 05:59 |
Apache Wiki |
[Nutch Wiki] Update of "store88" by store88 |
Tue, 09 Nov, 06:38 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "PublicServers" by seegnify |
Wed, 10 Nov, 14:21 |
Apache Wiki |
[Nutch Wiki] Update of "PublicServers" by SimaoFontes |
Fri, 12 Nov, 17:25 |
Apache Wiki |
[Nutch Wiki] Update of "FrontPage" by ChrisMattmann |
Sat, 13 Nov, 16:09 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "PublicServers" by seegnify |
Thu, 18 Nov, 14:33 |
Apache Wiki |
[Nutch Wiki] Update of "WritingPluginExample-1.2" by NiccoloBecchi |
Sat, 20 Nov, 22:06 |
Apache Wiki |
[Nutch Wiki] Update of "PluginCentral" by NiccoloBecchi |
Sat, 20 Nov, 22:12 |
Apache Wiki |
[Nutch Wiki] Update of "PublicServers" by dougcook |
Mon, 22 Nov, 23:00 |
Claudio Martella (JIRA) |
[jira] Created: (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) |
Tue, 23 Nov, 15:58 |
Claudio Martella (JIRA) |
[jira] Created: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 13:47 |
Claudio Martella (JIRA) |
[jira] Updated: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 13:49 |
Claudio Martella (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 14:45 |
Claudio Martella (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 14:53 |
Claudio Martella (JIRA) |
[jira] Created: (NUTCH-940) static field plugin |
Fri, 26 Nov, 15:19 |
Claudio Martella (JIRA) |
[jira] Updated: (NUTCH-940) static field plugin |
Fri, 26 Nov, 15:19 |
Claudio Martella (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 18:36 |
David Stuart (JIRA) |
[jira] Commented: (NUTCH-924) Static field in solr mapping |
Tue, 16 Nov, 10:45 |
Enrique Berlanga (JIRA) |
[jira] Created: (NUTCH-938) Imposible to fetch sites with robots.txt |
Tue, 23 Nov, 17:52 |
Enrique Berlanga (JIRA) |
[jira] Updated: (NUTCH-938) Imposible to fetch sites with robots.txt |
Tue, 23 Nov, 18:13 |
Enrique Berlanga (JIRA) |
[jira] Updated: (NUTCH-938) Imposible to fetch sites with robots.txt |
Wed, 24 Nov, 12:24 |
Enrique Berlanga (JIRA) |
[jira] Commented: (NUTCH-938) Imposible to fetch sites with robots.txt |
Thu, 25 Nov, 11:46 |
Enrique Berlanga (JIRA) |
[jira] Closed: (NUTCH-938) Imposible to fetch sites with robots.txt |
Tue, 30 Nov, 09:35 |
Julien Nioche (JIRA) |
[jira] Created: (NUTCH-934) Upgrade to Tika 0.8 |
Mon, 15 Nov, 11:53 |
Ken Krugler |
Charset detection algorithm |
Sat, 06 Nov, 19:03 |
Koray |
how to download image sound and video files? |
Mon, 29 Nov, 10:50 |
Markus Jelsma (JIRA) |
[jira] Commented: (NUTCH-924) Static field in solr mapping |
Tue, 16 Nov, 11:01 |
Markus Jelsma (JIRA) |
[jira] Created: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Fri, 19 Nov, 17:56 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Fri, 19 Nov, 18:44 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Mon, 22 Nov, 13:09 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Mon, 22 Nov, 13:09 |
Markus Jelsma (JIRA) |
[jira] Issue Comment Edited: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Mon, 22 Nov, 13:11 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats |
Mon, 22 Nov, 14:20 |
Markus Jelsma (JIRA) |
[jira] Commented: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats |
Mon, 22 Nov, 14:22 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats |
Mon, 22 Nov, 14:24 |
Markus Jelsma (JIRA) |
[jira] Issue Comment Edited: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats |
Mon, 22 Nov, 14:24 |
Markus Jelsma (JIRA) |
[jira] Commented: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument |
Mon, 22 Nov, 14:33 |
Markus Jelsma (JIRA) |
[jira] Commented: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats |
Mon, 22 Nov, 14:33 |
Markus Jelsma (JIRA) |
[jira] Updated: (NUTCH-935) remove unnecessary /./ in basic urlnormalizer |
Mon, 22 Nov, 14:53 |
Markus Jelsma (JIRA) |
[jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments |
Fri, 26 Nov, 13:53 |
Mattmann, Chris A (388J) |
My ApacheconNA 2010 Slides |
Sat, 06 Nov, 20:24 |
Mattmann, Chris A (388J) |
My ApacheconNA 2010 slides |
Sat, 06 Nov, 20:25 |
Sebastian Nagel (JIRA) |
[jira] Commented: (NUTCH-933) Fetcher does not save a pages Last-Modified value in CrawlDatum |
Wed, 10 Nov, 13:34 |
Senthil |
Re: My ApacheconNA 2010 slides |
Sun, 07 Nov, 09:12 |
Stondubleyt (JIRA) |
[jira] Created: (NUTCH-935) remove unnecessary /./ in basic urlnormalizer |
Wed, 17 Nov, 10:01 |
Stondubleyt (JIRA) |
[jira] Updated: (NUTCH-935) remove unnecessary /./ in basic urlnormalizer |
Wed, 17 Nov, 10:07 |