Karsten Dello (JIRA) |
[jira] Created: (NUTCH-424) CLONE - Problem persists with Nutch 0.8.1 (Nekohtml 0.9.4) - NekoHTML's DOMFragmentParser hangs on certain URLs |
Mon, 01 Jan, 21:27 |
Karsten Dello (JIRA) |
[jira] Commented: (NUTCH-424) CLONE - Problem persists with Nutch 0.8.1 (Nekohtml 0.9.4) - NekoHTML's DOMFragmentParser hangs on certain URLs |
Mon, 01 Jan, 21:42 |
thomasa...@gmx.net |
database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 00:00 |
Toufeeq Hussain |
Re: [Search-l] database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 01:50 |
Zaheed Haque |
Re: database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 09:44 |
Alan Tanaman |
New index-extra plugin and patch to IndexFilters |
Tue, 02 Jan, 10:24 |
nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 11:03 |
Alan Tanaman |
RE: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 11:52 |
Alan Tanaman |
Creating Lucence Compound Index |
Tue, 02 Jan, 12:57 |
Andrzej Bialecki |
Re: Creating Lucence Compound Index |
Tue, 02 Jan, 13:07 |
Alan Tanaman |
RE: Creating Lucence Compound Index |
Tue, 02 Jan, 13:34 |
Andrzej Bialecki |
Re: Creating Lucence Compound Index |
Tue, 02 Jan, 14:06 |
Alan Tanaman |
RE: Creating Lucence Compound Index |
Tue, 02 Jan, 14:12 |
thomasa...@gmx.net |
Re: database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 18:07 |
Nutch User |
Nutch Programmer Wanted |
Tue, 02 Jan, 21:24 |
Alan Tanaman (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 22:57 |
"Thomas Müller" |
Re: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Wed, 03 Jan, 06:35 |
Alan Tanaman |
RE: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Wed, 03 Jan, 12:09 |
Chee Wu |
nutch81 pages seems were not kept but no error message found |
Wed, 03 Jan, 12:30 |
Meghna Kukreja |
Bug in Nutch, possibly due to issues-273 and 322 |
Wed, 03 Jan, 19:03 |
Andrzej Bialecki |
Re: Bug in Nutch, possibly due to issues-273 and 322 |
Wed, 03 Jan, 19:50 |
Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 04 Jan, 09:30 |
Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 04 Jan, 09:30 |
srinath |
Issues Starting Hadoop Process in Nutch0.9l.1 |
Thu, 04 Jan, 17:00 |
st...@archive.org (JIRA) |
[jira] Created: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 17:21 |
st...@archive.org (JIRA) |
[jira] Updated: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 19:05 |
st...@archive.org (JIRA) |
[jira] Commented: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 19:14 |
st...@archive.org (JIRA) |
[jira] Created: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:12 |
st...@archive.org (JIRA) |
[jira] Commented: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:14 |
st...@archive.org (JIRA) |
[jira] Updated: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:14 |
Armel Nene (JIRA) |
[jira] Created: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 14:44 |
Armel Nene (JIRA) |
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 15:11 |
Armel T. Nene |
protocol-smb: a new protocol plugin for Windows Shares |
Fri, 05 Jan, 15:22 |
Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 15:56 |
Armel Nene (JIRA) |
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 16:02 |
Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Fri, 05 Jan, 17:01 |
Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Fri, 05 Jan, 17:01 |
Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-325) UrlFilters.java throws NPE in case urlfilter.order contains Filters that are not in plugin.includes |
Sat, 06 Jan, 09:44 |
Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Sat, 06 Jan, 10:36 |
Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-421) Allow predeterminate running order of index filters |
Sat, 06 Jan, 10:36 |
Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-421) Allow predeterminate running order of index filters |
Sat, 06 Jan, 20:01 |
Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:28 |
Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:28 |
Sami Siren (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:46 |
Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 09 Jan, 08:42 |
Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 09 Jan, 08:42 |
J. Delgado |
Job Opportunity (Sunnyvale, CA) |
Wed, 10 Jan, 03:20 |
Piyush (JIRA) |
[jira] Created: (NUTCH-428) NullPointerException |
Wed, 10 Jan, 14:57 |
DS jha |
sort result on different set of terms |
Wed, 10 Jan, 15:02 |
Piyush (JIRA) |
[jira] Created: (NUTCH-429) Secured Searches |
Thu, 11 Jan, 20:08 |
Piotr Kosiorowski (JIRA) |
[jira] Closed: (NUTCH-429) Secured Searches |
Thu, 11 Jan, 20:48 |
Dennis Kubes |
Re: sort result on different set of terms |
Thu, 11 Jan, 20:54 |
Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 11 Jan, 22:02 |
DS jha |
Re: sort result on different set of terms |
Fri, 12 Jan, 16:40 |
Sami Siren (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Fri, 12 Jan, 20:39 |
Sami Siren (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Fri, 12 Jan, 20:51 |
Dennis Kubes |
Re: sort result on different set of terms |
Fri, 12 Jan, 21:40 |
Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-428) NullPointerException |
Fri, 12 Jan, 22:16 |
Sami Siren (JIRA) |
[jira] Created: (NUTCH-430) integer overflow in HashComparator.compare |
Sat, 13 Jan, 23:07 |
Sami Siren (JIRA) |
[jira] Updated: (NUTCH-430) integer overflow in HashComparator.compare |
Sat, 13 Jan, 23:09 |
Scott Green |
How can I get one plugin's root dir |
Mon, 15 Jan, 02:40 |
Armel Nene (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Mon, 15 Jan, 10:12 |
Alan Tanaman (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Mon, 15 Jan, 10:52 |
Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-430) integer overflow in HashComparator.compare |
Mon, 15 Jan, 15:05 |
Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:14 |
Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:33 |
Dennis Kubes |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:44 |
Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:44 |
Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:51 |
Nathan ter Bogt |
Multiple collections |
Tue, 16 Jan, 04:08 |
fantoni benjamin (JIRA) |
[jira] Commented: (NUTCH-39) pagination in search result |
Tue, 16 Jan, 10:33 |
fantoni benjamin (JIRA) |
[jira] Commented: (NUTCH-39) pagination in search result |
Tue, 16 Jan, 10:55 |
Sami Siren |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:19 |
Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:31 |
Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:44 |
Sami Siren |
Next Nutch release |
Tue, 16 Jan, 15:53 |
Andrzej Bialecki |
Re: Next Nutch release |
Tue, 16 Jan, 16:19 |
Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 16:33 |
"Thomas Müller" |
Re: Next Nutch release |
Tue, 16 Jan, 16:37 |
Chris Mattmann |
Re: Next Nutch release |
Tue, 16 Jan, 16:40 |
Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 16:55 |
Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 17:39 |
Alan Tanaman |
RE: Next Nutch release |
Tue, 16 Jan, 17:48 |
Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 19:27 |
Doug Cutting |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 20:16 |
Mike Smith |
Amazon S3/Ec2 problem [injection and fs.rename() problem] |
Tue, 16 Jan, 20:30 |
Scott Green |
Re: How can I get one plugin's root dir |
Wed, 17 Jan, 03:03 |
Scott Green |
How to index in real time? |
Wed, 17 Jan, 03:15 |
Sean Dean |
Issue with trunk (rev 496535) |
Wed, 17 Jan, 07:19 |
Enis Soztutar |
Re: How to index in real time? |
Wed, 17 Jan, 14:15 |
Enis Soztutar |
Re: Next Nutch release |
Wed, 17 Jan, 14:42 |
Krebs, Urs |
SynonymEditor |
Wed, 17 Jan, 14:59 |
Alan Tanaman |
RE: How to index in real time? |
Wed, 17 Jan, 15:09 |
Michael Wechner |
Re: SynonymEditor |
Wed, 17 Jan, 15:11 |
Sami Siren |
Re: Next Nutch release |
Wed, 17 Jan, 15:17 |
Sami Siren |
Re: Next Nutch release |
Wed, 17 Jan, 15:21 |
Enis Soztutar |
Re: Next Nutch release |
Wed, 17 Jan, 15:38 |
Dennis Kubes |
Re: How can I get one plugin's root dir |
Wed, 17 Jan, 17:54 |
Armel T. Nene |
RE: Next Nutch release |
Wed, 17 Jan, 18:08 |
Andrzej Bialecki |
Re: Next Nutch release |
Wed, 17 Jan, 18:24 |