Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1443) Solr schema version is invalid |
Wed, 01 Aug, 11:35 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1443) Solr schema version is invalid |
Wed, 01 Aug, 11:39 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1444) Indexing should not create temporary files (do not extend from FileOutputFormat) |
Wed, 01 Aug, 13:49 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Wed, 01 Aug, 14:14 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1444) Indexing should not create temporary files (do not extend from FileOutputFormat) |
Wed, 01 Aug, 14:21 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Wed, 01 Aug, 14:23 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1444) Indexing should not create temporary files (do not extend from FileOutputFormat) |
Wed, 01 Aug, 14:23 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Wed, 01 Aug, 14:27 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Wed, 01 Aug, 14:37 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Thu, 02 Aug, 10:11 |
lufeng (JIRA) |
[jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries |
Fri, 03 Aug, 07:49 |
Luca Cavanna (JIRA) |
[jira] [Commented] (NUTCH-923) Multilingual support for Solr-index-mapping |
Fri, 03 Aug, 09:51 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-923) Multilingual support for Solr-index-mapping |
Fri, 03 Aug, 10:39 |
Luca Cavanna (JIRA) |
[jira] [Commented] (NUTCH-923) Multilingual support for Solr-index-mapping |
Fri, 03 Aug, 11:39 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Fri, 03 Aug, 15:10 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Fri, 03 Aug, 15:10 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1918 |
Sun, 05 Aug, 04:15 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1919 |
Mon, 06 Aug, 04:04 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Mon, 06 Aug, 10:12 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1445) Add ElasticIndexerJob that indexes to elasticsearch |
Mon, 06 Aug, 10:43 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1047) Pluggable indexing backends |
Mon, 06 Aug, 11:49 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1047) Pluggable indexing backends |
Mon, 06 Aug, 11:51 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1047) Pluggable indexing backends |
Mon, 06 Aug, 11:59 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1047) Pluggable indexing backends |
Mon, 06 Aug, 12:09 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1159) Write JUnit tests for index-anchor |
Mon, 06 Aug, 13:57 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-1159) Write JUnit tests for index-anchor |
Mon, 06 Aug, 13:59 |
Lewis John McGibbney (JIRA) |
[jira] [Assigned] (NUTCH-1160) Write JUnit tests for index-basic |
Mon, 06 Aug, 14:03 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1151) Index-anchor to add numInlinks count |
Mon, 06 Aug, 14:39 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1151) Index-anchor to add numInlinks count |
Mon, 06 Aug, 14:41 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files) |
Mon, 06 Aug, 14:49 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney |
Mon, 06 Aug, 19:42 |
Lewis John Mcgibbney |
Re: Understanding mapping of field characteristics to index structure |
Mon, 06 Aug, 22:01 |
Markus Jelsma |
RE: Understanding mapping of field characteristics to index structure |
Mon, 06 Aug, 22:12 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1920 |
Tue, 07 Aug, 04:05 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1444) Indexing should not create temporary files (do not extend from FileOutputFormat) |
Tue, 07 Aug, 07:10 |
Ferdy Galema |
hadoop.job.history.user.location in nutch-default with CDH rendering job history useless |
Tue, 07 Aug, 09:21 |
Lewis John Mcgibbney |
Re: Understanding mapping of field characteristics to index structure |
Tue, 07 Aug, 11:41 |
Markus Jelsma |
RE: Understanding mapping of field characteristics to index structure |
Tue, 07 Aug, 11:49 |
lewis john mcgibbney |
[ANNOUNCE] Apache Gora 0.2.1 Released |
Tue, 07 Aug, 18:50 |
Trần Anh Tuấn (JIRA) |
[jira] [Created] (NUTCH-1447) Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected |
Wed, 08 Aug, 13:01 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1921 |
Wed, 08 Aug, 13:48 |
Trần Anh Tuấn (JIRA) |
[jira] [Updated] (NUTCH-1447) Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected |
Wed, 08 Aug, 14:11 |
Sebastian Nagel (JIRA) |
[jira] [Updated] (NUTCH-706) Url regex normalizer |
Wed, 08 Aug, 21:51 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-trunk #1922 |
Thu, 09 Aug, 04:36 |
Julien Nioche |
Happy 10th Birthday Nutch! |
Thu, 09 Aug, 07:56 |
Ferdy Galema |
Re: Happy 10th Birthday Nutch! |
Thu, 09 Aug, 08:10 |
Lewis John Mcgibbney |
Re: Happy 10th Birthday Nutch! |
Thu, 09 Aug, 20:31 |
Sebastian Nagel |
duplicate jar files by plugin dependencies |
Thu, 09 Aug, 21:38 |
Mattmann, Chris A (388J) |
Re: Happy 10th Birthday Nutch! |
Thu, 09 Aug, 23:44 |
Lewis John Mcgibbney |
Re: duplicate jar files by plugin dependencies |
Fri, 10 Aug, 09:37 |
Julien Nioche |
Re: duplicate jar files by plugin dependencies |
Fri, 10 Aug, 10:10 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #315 |
Sun, 12 Aug, 16:49 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #316 |
Sun, 12 Aug, 17:02 |
Lewis John Mcgibbney |
Re: Jenkins build is back to normal : Nutch-nutchgora #316 |
Sun, 12 Aug, 17:07 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #317 |
Mon, 13 Aug, 04:05 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1926 |
Mon, 13 Aug, 04:06 |
Apache Wiki |
[Nutch Wiki] Update of "FrontPage" by FerdyGalema |
Mon, 13 Aug, 08:26 |
Apache Wiki |
[Nutch Wiki] Update of "FrontPage" by FerdyGalema |
Mon, 13 Aug, 08:31 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1448) Redirected urls should be handled more cleanly (more like an outlink url) |
Mon, 13 Aug, 08:47 |
Ferdy Galema (JIRA) |
[jira] [Updated] (NUTCH-1448) Redirected urls should be handled more cleanly (more like an outlink url) |
Mon, 13 Aug, 08:51 |
Markus Jelsma (JIRA) |
[jira] [Created] (NUTCH-1449) Optionally delete documents skipped by IndexingFilters |
Mon, 13 Aug, 14:48 |
Ken Krugler (JIRA) |
[jira] [Commented] (NUTCH-1233) Rely on Tika for outlink extraction |
Mon, 13 Aug, 16:46 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1233) Rely on Tika for outlink extraction |
Mon, 13 Aug, 17:00 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-1450) Upgrade to gora deps to 0.2.1 |
Mon, 13 Aug, 17:18 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-1451) Upgrade automaton jar to 1.11-8 |
Mon, 13 Aug, 17:26 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1450) Upgrade to gora deps to 0.2.1 |
Mon, 13 Aug, 17:32 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-1450) Upgrade to gora deps to 0.2.1 |
Mon, 13 Aug, 17:34 |
Lewis John McGibbney (JIRA) |
[jira] [Updated] (NUTCH-1442) indexingfilter.order is property is misread in code |
Mon, 13 Aug, 17:47 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1442) indexingfilter.order is property is misread in code |
Mon, 13 Aug, 17:57 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Mon, 13 Aug, 20:40 |
Lewis John McGibbney (JIRA) |
[jira] [Resolved] (NUTCH-1442) indexingfilter.order is property is misread in code |
Mon, 13 Aug, 20:42 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1442) indexingfilter.order is property is misread in code |
Tue, 14 Aug, 07:24 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1442) indexingfilter.order is property is misread in code |
Tue, 14 Aug, 07:24 |
Ferdy Galema (JIRA) |
[jira] [Closed] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration |
Tue, 14 Aug, 07:32 |
Ferdy Galema |
Re: hadoop.job.history.user.location in nutch-default with CDH rendering job history useless |
Tue, 14 Aug, 08:31 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1452) hadoop.job.history.user.location in nutch-default making job history useless |
Tue, 14 Aug, 08:31 |
Lewis John McGibbney (JIRA) |
[jira] [Created] (NUTCH-1453) Substantiate tests for IndexingFilters |
Tue, 14 Aug, 09:27 |
Lewis John McGibbney (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Tue, 14 Aug, 12:33 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Tue, 14 Aug, 12:56 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-1454) parsing chm failed |
Tue, 14 Aug, 20:11 |
Sebastian Nagel (JIRA) |
[jira] [Created] (NUTCH-1455) RobotRulesParser to match multi-word user-agent names |
Tue, 14 Aug, 22:02 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1455) RobotRulesParser to match multi-word user-agent names |
Tue, 14 Aug, 23:13 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #318 |
Wed, 15 Aug, 04:16 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-trunk #1927 |
Wed, 15 Aug, 04:24 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 09:11 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 09:42 |
Markus Jelsma (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 09:46 |
Julien Nioche (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 10:06 |
Markus Jelsma (JIRA) |
[jira] [Updated] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 10:14 |
lin weijian |
DbUpdateReducer could not mark it's batchid |
Wed, 15 Aug, 11:59 |
Ferdy Galema (JIRA) |
[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex |
Wed, 15 Aug, 12:03 |
lin weijian |
A FetchSchedule bug makes fetch time becoming more and more big |
Wed, 15 Aug, 12:11 |
Ferdy Galema |
Re: DbUpdateReducer could not mark it's batchid |
Wed, 15 Aug, 12:12 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1456) Updater not setting batchId in markers correctly. |
Wed, 15 Aug, 12:12 |
Ferdy Galema (JIRA) |
[jira] [Created] (NUTCH-1457) Nutch2 Refactor the update process so that fetched items are only processed once |
Wed, 15 Aug, 12:20 |
Ferdy Galema |
Re: A FetchSchedule bug makes fetch time becoming more and more big |
Wed, 15 Aug, 12:24 |
Ken Krugler (JIRA) |
[jira] [Commented] (NUTCH-1455) RobotRulesParser to match multi-word user-agent names |
Wed, 15 Aug, 13:58 |
Ken Krugler |
Re: bug in parse-tika or Tika RTFParser? |
Thu, 16 Aug, 00:28 |
Apache Jenkins Server |
Jenkins build is back to normal : Nutch-nutchgora #319 |
Thu, 16 Aug, 04:19 |
weishenyun |
Can Nutch process rel-tag likes rel="nofollow"? |
Thu, 16 Aug, 04:27 |