nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Nioche (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4
Date Fri, 02 May 2014 08:29:16 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13987496#comment-13987496
] 

Julien Nioche commented on NUTCH-1714:
--------------------------------------

Hi [~alparslan.avci] 

It does not fix the issue. The main difference between the 2 last patches is 

{quote}
> -    query.setFields(WebPage._ALL_FIELDS);
> +
> +    //remove the __g__dirty field since it is not stored
> +    String[] fields = Arrays.copyOfRange(WebPage._ALL_FIELDS, 1,
> +        WebPage._ALL_FIELDS.length);
> +    query.setFields(fields);
{quote}

I can't quite see how this would fix it. BTW can you be sure that  __g__ will always be at
the first position in the array? Wouldn't it be cleaner to put it in a List and remove it
by its name?



> Nutch 2.x upgrade to Gora 0.4
> -----------------------------
>
>                 Key: NUTCH-1714
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1714
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Alparslan Avcı
>            Assignee: Alparslan Avcı
>             Fix For: 2.3
>
>         Attachments: NUTCH-1714.patch, NUTCH-1714_NUTCH-1714_v2_v3.patch, NUTCH-1714v2.patch,
NUTCH-1714v4.patch, NUTCH-1714v5.patch
>
>
> Nutch upgrade for GORA_94 branch has to be implemented. We can discuss the details in
this issue.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message