Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1256) WebGraph to dump host + score |
Wed, 01 Feb, 04:22 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment |
Wed, 01 Feb, 04:22 |
Ferdy Galema (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1081) ant tests fail |
Wed, 01 Feb, 09:16 |
Julien Nioche (Created) (JIRA) |
[jira] [Created] (NUTCH-1264) Configurable indexing plugin (index-extra) |
Wed, 01 Feb, 12:19 |
Julien Nioche (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1264) Configurable indexing plugin (index-extra) |
Wed, 01 Feb, 13:51 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 14:18 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 14:32 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 14:40 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 14:44 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 15:01 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 15:03 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 15:05 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 15:08 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Wed, 01 Feb, 15:16 |
Sujit Pal (Created) (JIRA) |
[jira] [Created] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 00:49 |
Sujit Pal (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 00:51 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 00:59 |
Sujit Pal (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 01:19 |
Sujit Pal (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 01:21 |
Lewis John Mcgibbney |
NUTCH-1205 |
Thu, 02 Feb, 12:16 |
Ferdy Galema (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 14:47 |
Lewis John McGibbney (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 16:16 |
Lewis John McGibbney (Closed) (JIRA) |
[jira] [Closed] (NUTCH-1265) [nutchgora] - update to work with gora-0.2-incubating |
Thu, 02 Feb, 16:16 |
nutchsolruser |
Nutch db_unfetched |
Fri, 03 Feb, 08:37 |
nutchsolruser |
Problem with db.max.anchor.length property in nutch-default.xml |
Fri, 03 Feb, 13:19 |
Abhay Dabholkar (Commented) (JIRA) |
[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed |
Fri, 03 Feb, 14:55 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1140) index-more plugin, resetTitle method creates multiple values in the Title field |
Sun, 05 Feb, 10:33 |
Mattmann, Chris A (388J) |
Fwd: [Announce] Google Summer of Code 2012 |
Mon, 06 Feb, 04:24 |
Mattmann, Chris A (388J) |
Fwd: [Announce] Google Summer of Code 2012 |
Mon, 06 Feb, 04:28 |
Markus Jelsma (Created) (JIRA) |
[jira] [Created] (NUTCH-1266) Subcollection to optionally write to configured fields |
Mon, 06 Feb, 13:15 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1266) Subcollection to optionally write to configured fields |
Mon, 06 Feb, 13:35 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1266) Subcollection to optionally write to configured fields |
Mon, 06 Feb, 13:35 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1005) Index headings plugin |
Mon, 06 Feb, 14:17 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1264) Configurable indexing plugin (index-extra) |
Mon, 06 Feb, 14:19 |
Julien Nioche (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1264) Configurable indexing plugin (index-extra) |
Mon, 06 Feb, 16:29 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1264) Configurable indexing plugin (index-extra) |
Mon, 06 Feb, 16:31 |
Julien Nioche (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Mon, 06 Feb, 16:33 |
Julien Nioche (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1264) Configurable indexing plugin (index-metadata) |
Mon, 06 Feb, 16:37 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1264) Configurable indexing plugin (index-metadata) |
Mon, 06 Feb, 16:51 |
Julien Nioche (Created) (JIRA) |
[jira] [Created] (NUTCH-1267) urlmeta to delegate indexing to index-metadata |
Mon, 06 Feb, 16:55 |
Julien Nioche (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1264) Configurable indexing plugin (index-metadata) |
Mon, 06 Feb, 16:57 |
Julien Nioche (Created) (JIRA) |
[jira] [Created] (NUTCH-1268) parse-meta to delegate indexing to index-metadata |
Mon, 06 Feb, 16:57 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1264) Configurable indexing plugin (index-metadata) |
Mon, 06 Feb, 17:25 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1264) Configurable indexing plugin (index-metadata) |
Tue, 07 Feb, 04:30 |
linyuan |
unsubscribe |
Tue, 07 Feb, 06:45 |
swaraj |
unsubscribe |
Tue, 07 Feb, 07:30 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Index headings plugin |
Tue, 07 Feb, 10:42 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter |
Tue, 07 Feb, 10:45 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1266) Subcollection to optionally write to configured fields |
Tue, 07 Feb, 10:45 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1005) Parse headings plugin |
Tue, 07 Feb, 10:48 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1005) Parse headings plugin |
Tue, 07 Feb, 13:26 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1266) Subcollection to optionally write to configured fields |
Tue, 07 Feb, 13:49 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Parse headings plugin |
Tue, 07 Feb, 14:02 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Tue, 07 Feb, 15:24 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Tue, 07 Feb, 15:26 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata |
Tue, 07 Feb, 15:32 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata |
Tue, 07 Feb, 15:32 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Tue, 07 Feb, 15:36 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Tue, 07 Feb, 15:46 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Tue, 07 Feb, 18:36 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1005) Parse headings plugin |
Wed, 08 Feb, 04:30 |
behnam nikbakht (Created) (JIRA) |
[jira] [Created] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 10:24 |
behnam nikbakht (Created) (JIRA) |
[jira] [Created] (NUTCH-1270) some of Deflate encoded pages not fetched |
Wed, 08 Feb, 10:39 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 10:40 |
Lewis John McGibbney (Created) (JIRA) |
[jira] [Created] (NUTCH-1271) Fix errors @ compile time |
Wed, 08 Feb, 10:46 |
behnam nikbakht (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 11:18 |
behnam nikbakht (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 11:18 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 11:31 |
behnam nikbakht (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 11:50 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1269) Generate main problems |
Wed, 08 Feb, 12:03 |
Lewis John McGibbney (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1270) some of Deflate encoded pages not fetched |
Wed, 08 Feb, 12:35 |
Lewis John Mcgibbney |
Fwd: Mandatory svnpubsub migration by Jan 2013 |
Wed, 08 Feb, 12:40 |
Markus Jelsma |
tika-core, tika-parser |
Wed, 08 Feb, 12:50 |
Lewis John Mcgibbney |
Re: tika-core, tika-parser |
Wed, 08 Feb, 12:58 |
Markus Jelsma |
Re: tika-core, tika-parser |
Wed, 08 Feb, 13:00 |
Julien Nioche |
Re: Mandatory svnpubsub migration by Jan 2013 |
Wed, 08 Feb, 13:00 |
Lewis John Mcgibbney |
Re: Mandatory svnpubsub migration by Jan 2013 |
Wed, 08 Feb, 13:03 |
Markus Jelsma |
Re: tika-core, tika-parser |
Wed, 08 Feb, 13:03 |
Julien Nioche |
Re: tika-core, tika-parser |
Wed, 08 Feb, 13:04 |
Julien Nioche |
Re: tika-core, tika-parser |
Wed, 08 Feb, 13:22 |
Markus Jelsma |
Re: tika-core, tika-parser |
Wed, 08 Feb, 13:28 |
Peter Jameson |
Finding specific file types only --> *.ics files |
Wed, 08 Feb, 17:04 |
Ken Krugler |
Re: tika-core, tika-parser |
Wed, 08 Feb, 17:27 |
dibyendu ghosh (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1206) tika parser of nutch 1.3 is failing to prcess pdfs |
Thu, 09 Feb, 09:38 |
Markus Jelsma |
Re: tika-core, tika-parser |
Thu, 09 Feb, 09:56 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata |
Thu, 09 Feb, 09:57 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1266) Subcollection to optionally write to configured fields |
Thu, 09 Feb, 09:57 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Thu, 09 Feb, 09:57 |
Markus Jelsma (Updated) (JIRA) |
[jira] [Updated] (NUTCH-1262) Map `duplicating` content-types to a single type |
Thu, 09 Feb, 09:57 |
Markus Jelsma |
Re: Finding specific file types only --> *.ics files |
Thu, 09 Feb, 09:59 |
Markus Jelsma (Resolved) (JIRA) |
[jira] [Resolved] (NUTCH-1145) Add linkrank config directives to default conf |
Thu, 09 Feb, 10:00 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1266) Subcollection to optionally write to configured fields |
Thu, 09 Feb, 10:02 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Thu, 09 Feb, 11:48 |
Peter Jameson |
Re: Finding specific file types only --> *.ics files |
Thu, 09 Feb, 14:18 |
Markus Jelsma |
Re: Finding specific file types only --> *.ics files |
Thu, 09 Feb, 14:29 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Thu, 09 Feb, 14:47 |
Markus Jelsma (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin |
Thu, 09 Feb, 15:27 |
Julien Nioche (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata |
Thu, 09 Feb, 16:19 |
Apache Jenkins Server |
Build failed in Jenkins: Nutch-nutchgora #158 |
Fri, 10 Feb, 04:17 |
Hudson (Commented) (JIRA) |
[jira] [Commented] (NUTCH-1266) Subcollection to optionally write to configured fields |
Fri, 10 Feb, 04:31 |