nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1988) Make nested output directory dump optional
Date Wed, 15 Apr 2015 19:23:59 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14496751#comment-14496751
] 

ASF GitHub Bot commented on NUTCH-1988:
---------------------------------------

GitHub user MJJoyce opened a pull request:

    https://github.com/apache/nutch/pull/19

    NUTCH-1988 - Add optional flat directory flag to dump command

    - Add optional flatdir flag to dump command so that a user can dump
      their crawl data to a flat directory instead of the nested structure
      added in NUTCH-1957.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/MJJoyce/nutch NUTCH-1988

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/19.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19
    
----
commit 40ca3e576781328b9b5afc22548a93bfd3df75bd
Author: Michael Joyce <mltjoyce@gmail.com>
Date:   2015-04-15T19:19:07Z

    NUTCH-1988 - Add optional flat directory flag to dump command
    
    - Add optional flatdir flag to dump command so that a user can dump
      their crawl data to a flat directory instead of the nested structure
      added in NUTCH-1957.

----


> Make nested output directory dump optional
> ------------------------------------------
>
>                 Key: NUTCH-1988
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1988
>             Project: Nutch
>          Issue Type: Improvement
>          Components: dumpers
>    Affects Versions: 1.9
>            Reporter: Michael Joyce
>            Priority: Minor
>             Fix For: 1.10
>
>
> NUTCH-1957 added nested directories to the bin/nutch dump output to help avoid naming
conflicts in output files. It would be nice to be able to specify that you want the older
flat directory output as an optional parameter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message