Md Mahir Asef Kabir (Jira) |
[jira] [Created] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 03:21 |
GitBox |
[GitHub] [nutch] AthenaXiao opened a new pull request #524: [NUTCH-2786] add a warning for insecure TrustManager |
Mon, 04 May, 16:05 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 16:06 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 20:05 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 20:06 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 20:06 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Mon, 04 May, 20:06 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2434) Add methods to reset parameters HTMLMetaTags |
Tue, 05 May, 09:32 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-1652) Avoid instanciation of MimeUtil for each Content object created |
Tue, 05 May, 09:45 |
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-1945) Test for XLSX parser |
Tue, 05 May, 09:53 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-1945) Test for XLSX parser |
Tue, 05 May, 09:53 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2434) Add methods to reset parameters HTMLMetaTags |
Tue, 05 May, 09:55 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #525: NUTCH-1945 Test for XLSX parser |
Tue, 05 May, 11:31 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-1945) Test for XLSX parser |
Tue, 05 May, 11:32 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-1806) Delegate processing of URL domains to crawler commons |
Tue, 05 May, 11:33 |
GitBox |
[GitHub] [nutch] sebastian-nagel commented on pull request #514: NUTCH-1194 Generator: CrawlDB lock should be released earlier |
Tue, 05 May, 11:38 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-1194) Generator: CrawlDB lock should be released earlier |
Tue, 05 May, 11:39 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-1194) Generator: CrawlDB lock should be released earlier |
Tue, 05 May, 12:11 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-1194) Generator: CrawlDB lock should be released earlier |
Tue, 05 May, 12:55 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2785) FreeGenerator: command-line option to define number of generated fetch lists |
Tue, 05 May, 13:58 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2002) ParserChecker and IndexingFiltersChecker to check robots.txt |
Tue, 05 May, 14:00 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2753) Add -listen option to command-line help of CrawlDbReader and LinkDbReader |
Tue, 05 May, 14:01 |
Sebastian Nagel (Jira) |
[jira] [Assigned] (NUTCH-2753) Add -listen option to command-line help of CrawlDbReader and LinkDbReader |
Tue, 05 May, 14:01 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2758) Add plugin READMEs to binary release packages |
Tue, 05 May, 14:02 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2753) Add -listen option to command-line help of CrawlDbReader and LinkDbReader |
Tue, 05 May, 14:55 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2785) FreeGenerator: command-line option to define number of generated fetch lists |
Tue, 05 May, 14:55 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2002) ParserChecker and IndexingFiltersChecker to check robots.txt |
Tue, 05 May, 14:55 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2758) Add plugin READMEs to binary release packages |
Tue, 05 May, 14:55 |
Md Mahir Asef Kabir (Jira) |
[jira] [Updated] (NUTCH-2786) TrustManager methods do not have certificate validation logic |
Fri, 08 May, 18:49 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2419) Domain blacklist URL filter does not respect command-line override for file |
Tue, 12 May, 13:12 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Tue, 12 May, 13:20 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2419) Domain blacklist URL filter does not respect command-line override for file |
Tue, 12 May, 13:20 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Tue, 12 May, 13:21 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-1945) Test for XLSX parser |
Tue, 12 May, 13:36 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-1945) Test for XLSX parser |
Tue, 12 May, 14:02 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2318) Text extraction in HtmlParser adds too much whitespace. |
Tue, 12 May, 17:11 |
Markus Jelsma (Jira) |
[jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Wed, 13 May, 10:27 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Wed, 13 May, 12:42 |
Markus Jelsma (Jira) |
[jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Wed, 13 May, 12:57 |
GitBox |
[GitHub] [nutch] sebastian-nagel merged pull request #526: NUTCH-2419 Some URL filters and normalizers do not respect command-line override for rule file |
Thu, 14 May, 15:43 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Thu, 14 May, 15:44 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Thu, 14 May, 15:44 |
Hudson (Jira) |
[jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file |
Thu, 14 May, 16:00 |
Shashanka Balakuntala Srinivasa (Jira) |
[jira] [Commented] (NUTCH-2596) Upgrade from org.mortbay.jetty to org.eclipse.jetty |
Fri, 15 May, 15:47 |
Sebastian Nagel (Jira) |
[jira] [Commented] (NUTCH-2596) Upgrade from org.mortbay.jetty to org.eclipse.jetty |
Fri, 15 May, 16:37 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #527: NUTCH-2496 Speed up link inversion step in crawling script |
Fri, 15 May, 17:22 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script |
Fri, 15 May, 17:23 |
Sebastian Nagel (Jira) |
[jira] [Updated] (NUTCH-2496) Speed up link inversion step in crawling script |
Fri, 15 May, 17:26 |
Sebastian Nagel (Jira) |
[jira] [Resolved] (NUTCH-1971) The crawldb.url.filters property is not present in any configuration file |
Fri, 15 May, 17:31 |
GitBox |
[GitHub] [nutch] sebastian-nagel opened a new pull request #528: NUTCH-2720 ROBOTS metatag ignored when capitalized |
Fri, 15 May, 21:18 |
ASF GitHub Bot (Jira) |
[jira] [Commented] (NUTCH-2720) ROBOTS metatag ignored when capitalized |
Fri, 15 May, 21:19 |
Sandro Osswald (Jira) |
[jira] [Commented] (NUTCH-2567) parse-metatags writes all meta tags twice |
Mon, 18 May, 12:54 |
Sandro Osswald (Jira) |
[jira] [Comment Edited] (NUTCH-2567) parse-metatags writes all meta tags twice |
Mon, 18 May, 12:54 |