nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zuber (JIRA)" <>
Subject [jira] [Created] (NUTCH-2319) Link with "rel=alternate" doesn't return in crawl
Date Sat, 01 Oct 2016 12:11:20 GMT
Zuber created NUTCH-2319:

             Summary: Link with "rel=alternate" doesn't return in crawl 
                 Key: NUTCH-2319
             Project: Nutch
          Issue Type: Bug
            Reporter: Zuber

I am using nutch-1.4. I am getting the issue that the nutch doesn't return the URLs from the
link rel="alternate".
 For example, I am trying to crawl the URL which
contains the  below link which I am not getting as result.
<link rel="alternate" type="application/atom+xml" href=";x=1"
title="Phoenix - ASU">

Could you please help

This message was sent by Atlassian JIRA

View raw message