nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (NUTCH-1944) Add raw content to indexes
Date Fri, 10 Apr 2015 05:21:12 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann resolved NUTCH-1944.
--------------------------------------
    Resolution: Fixed

Committed the pull request (#8) from meabed:

{noformat}
[chipotle:~/tmp/nutch2.x] mattmann% svn commit -m "fix for NUTCH-1944 Index HTML raw content
contributed by meabed this closes #8."
Sending        CHANGES.txt
Sending        conf/schema.xml
Adding         src/plugin/index-html
Adding         src/plugin/index-html/build.xml
Adding         src/plugin/index-html/ivy.xml
Adding         src/plugin/index-html/plugin.xml
Adding         src/plugin/index-html/src
Adding         src/plugin/index-html/src/java
Adding         src/plugin/index-html/src/java/org
Adding         src/plugin/index-html/src/java/org/apache
Adding         src/plugin/index-html/src/java/org/apache/nutch
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/HtmlIndexingFilter.java
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/README.md
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/package.html
Transmitting file data ........
Committed revision 1672542.
{noformat}

Has been sitting for a while and it's a good start. We can build off this with Seb's comments.
Thanks meabed!

> Add raw content to indexes
> --------------------------
>
>                 Key: NUTCH-1944
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1944
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer, plugin
>            Reporter: Lewis John McGibbney
>            Assignee: Chris A. Mattmann
>             Fix For: 2.4
>
>
> The issues is described very well here
> https://github.com/Meabed/nutch2-index-html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message