Julien Nioche (JIRA) |
[jira] Created: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
Mon, 01 Mar, 15:03 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
Mon, 01 Mar, 15:05 |
Julien Nioche (JIRA) |
[jira] Closed: (NUTCH-782) Ability to order htmlparsefilters |
Mon, 01 Mar, 15:09 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-782) Ability to order htmlparsefilters |
Tue, 02 Mar, 04:11 |
|
[Nutch Wiki] Update of "Becoming_A_Nutch_Developer" by maqboolzee |
|
Apache Wiki |
[Nutch Wiki] Update of "Becoming_A_Nutch_Developer" by maqboolzee |
Wed, 03 Mar, 02:39 |
Apache Wiki |
[Nutch Wiki] Update of "Becoming_A_Nutch_Developer" by maqboolzee |
Wed, 03 Mar, 02:45 |
Lukáš Vlček |
Ning's HTTP Client Library |
Fri, 05 Mar, 06:02 |
kadiyalasubhash |
problem while crawling with nucht 1.0 |
Fri, 05 Mar, 07:13 |
|
[jira] Commented: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
|
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
Fri, 05 Mar, 09:56 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
Sat, 06 Mar, 04:09 |
Julien Nioche (JIRA) |
[jira] Closed: (NUTCH-799) SOLRIndexer to commit once all reducers have finished |
Fri, 05 Mar, 10:10 |
Jesse Campbell (JIRA) |
[jira] Created: (NUTCH-800) Generator builds a URL list that is not encoded |
Fri, 05 Mar, 22:01 |
h...@adonimi.nl |
unsubscribe |
Sat, 06 Mar, 00:29 |
|
[jira] Updated: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
|
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Sat, 06 Mar, 13:14 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Sat, 06 Mar, 13:14 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 09:04 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 10:22 |
hussam hamdan |
hi |
Sun, 07 Mar, 13:38 |
Sahil Shah |
adding an Index attribute |
Mon, 08 Mar, 06:35 |
MilleBii |
Re: adding an Index attribute |
Mon, 08 Mar, 07:43 |
Sahil Shah |
Re: adding an Index attribute |
Mon, 08 Mar, 07:59 |
Mattmann, Chris A (388J) |
1.1 release? |
Tue, 09 Mar, 17:09 |
Julien Nioche |
Re: 1.1 release? |
Tue, 09 Mar, 17:17 |
Andrzej Bialecki |
Re: 1.1 release? |
Tue, 09 Mar, 18:54 |
Mattmann, Chris A (388J) |
Re: 1.1 release? |
Wed, 31 Mar, 16:17 |
|
[jira] Commented: (NUTCH-798) Upgrade to SOLR1.4 |
|
Sami Siren (JIRA) |
[jira] Commented: (NUTCH-798) Upgrade to SOLR1.4 |
Wed, 10 Mar, 12:23 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-798) Upgrade to SOLR1.4 |
Wed, 10 Mar, 13:17 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-798) Upgrade to SOLR1.4 |
Fri, 12 Mar, 04:11 |
Julien Nioche (JIRA) |
[jira] Created: (NUTCH-801) Remove RTF and MP3 parse plugins |
Wed, 10 Mar, 14:47 |
|
[jira] Commented: (NUTCH-801) Remove RTF and MP3 parse plugins |
|
Chris A. Mattmann (JIRA) |
[jira] Commented: (NUTCH-801) Remove RTF and MP3 parse plugins |
Wed, 10 Mar, 14:51 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-801) Remove RTF and MP3 parse plugins |
Wed, 10 Mar, 15:17 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-801) Remove RTF and MP3 parse plugins |
Fri, 12 Mar, 04:11 |
Santiago Pérez |
Increasing the score of especific pages |
Thu, 11 Mar, 10:24 |
Julien Nioche (JIRA) |
[jira] Resolved: (NUTCH-798) Upgrade to SOLR1.4 |
Thu, 11 Mar, 13:07 |
Julien Nioche (JIRA) |
[jira] Resolved: (NUTCH-801) Remove RTF and MP3 parse plugins |
Thu, 11 Mar, 13:26 |
nikinch |
Creating new linked entries in crawlDB |
Thu, 11 Mar, 14:46 |
Jesiel Trevisan |
Re: Creating new linked entries in crawlDB |
Thu, 11 Mar, 14:52 |
Piet Schrijver (JIRA) |
[jira] Commented: (NUTCH-650) Hbase Integration |
Fri, 12 Mar, 13:16 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-710) Support for rel="canonical" attribute |
Mon, 15 Mar, 14:36 |
Julien Nioche (JIRA) |
[jira] Resolved: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 |
Mon, 15 Mar, 15:54 |
|
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam |
|
Apache Wiki |
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam |
Mon, 15 Mar, 21:37 |
Apache Wiki |
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam |
Mon, 15 Mar, 22:01 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "HttpAuthenticationSchemes" by susam |
Mon, 15 Mar, 22:04 |
|
[Nutch Wiki] Trivial Update of "Crawl" by susam |
|
Apache Wiki |
[Nutch Wiki] Trivial Update of "Crawl" by susam |
Mon, 15 Mar, 22:08 |
Apache Wiki |
[Nutch Wiki] Trivial Update of "Crawl" by susam |
Mon, 15 Mar, 22:10 |
garpinc (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Mon, 15 Mar, 22:38 |
garpinc (JIRA) |
[jira] Updated: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Mon, 15 Mar, 22:40 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 |
Tue, 16 Mar, 12:01 |
|
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages. |
|
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages. |
Tue, 16 Mar, 12:43 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages. |
Tue, 23 Mar, 04:13 |
|
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages. |
|
Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages. |
Tue, 16 Mar, 19:46 |
Julien Nioche (JIRA) |
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages. |
Fri, 19 Mar, 17:30 |
|
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
|
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Tue, 16 Mar, 21:30 |
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Tue, 16 Mar, 21:44 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Tue, 16 Mar, 22:20 |
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Thu, 18 Mar, 14:01 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Thu, 18 Mar, 14:24 |
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Thu, 18 Mar, 14:32 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 11:45 |
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 12:03 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 12:49 |
Julien Nioche (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 14:19 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Mon, 22 Mar, 15:45 |
Hudson (JIRA) |
[jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB |
Tue, 23 Mar, 04:13 |
|
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
|
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 12:41 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 13:52 |
Ken Krugler (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 14:06 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 14:54 |
Ken Krugler (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 16:38 |
Robert Hohman (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 17:21 |
Jukka Zitting (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 18:43 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 18:55 |
Jukka Zitting (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Thu, 18 Mar, 12:03 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Thu, 18 Mar, 14:18 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Fri, 19 Mar, 10:11 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" |
Wed, 17 Mar, 13:58 |
Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-796) Zero results problems difficult to troubleshoot due to lack of logging |
Wed, 17 Mar, 14:28 |
|
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0. |
|
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0. |
Wed, 17 Mar, 14:30 |
Dawid Weiss (JIRA) |
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0. |
Wed, 17 Mar, 14:44 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0. |
Fri, 19 Mar, 11:23 |
Andrzej Bialecki (JIRA) |
[jira] Assigned: (NUTCH-774) Retry interval in crawl date is set to 0 |
Wed, 17 Mar, 15:02 |
Pablo Aragón (JIRA) |
[jira] Created: (NUTCH-802) Problems managing outlinks with large url length |
Thu, 18 Mar, 10:40 |