nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From MengYing Wang <mengyingwa...@gmail.com>
Subject Re: [nsf-polar-usc-students] ExceptionInInitializerError caused by NPE
Date Thu, 20 Nov 2014 21:38:59 GMT
Dear Prof Mattmann,

Yes, I will create a jira and attach the patch. But one more thing, do you
happen to know how to modify the parse-tika configuration files to
automatically download the rome-0.9.jar instead of the rome-1.0.jar?

Currently, if you run the "ant -f ./build-ivy.xml" command in the
parse-tika folder, the rome-1.0.jar is downloaded. I have to manually
download the rome-0.9.jar file into the src/plugin/parse-tika/lib
directory, and then modify the src/plugin/parse-tika/plugin.xml file to use
rome-0.9.jar instead of rome-1.0.jar, which is not so convenient. Thanks
for your help!

Best,
Mengying (Angela) Wang

On Thu, Nov 20, 2014 at 3:12 AM, Chris Mattmann <chris.mattmann@gmail.com>
wrote:

> Great, can you attach a patch for this?
>
> ------------------------
> Chris Mattmann
> chris.mattmann@gmail.com
>
>
>
>
> -----Original Message-----
> From: MengYing Wang <mengyingwang1@gmail.com>
> Date: Thursday, November 20, 2014 at 7:02 PM
> To: Lewis John Mcgibbney <lewis.mcgibbney@gmail.com>
> Cc: "dev@nutch.apache.org" <dev@nutch.apache.org>, NSF Polar
> CyberInfrastructure DR Students <nsf-polar-usc-students@googlegroups.com>
> Subject: Re: [nsf-polar-usc-students] ExceptionInInitializerError caused
> by NPE
>
> >Dear Lewis,
> >Problem solved by replacing the rome-1.0.jar back to rome-0.9.jar in
> >parse-tika. Same idea as the feed parser in
> >https://issues.apache.org/jira/browse/NUTCH-1494. Thanks.
> >
> >Best,
> >Mengying (Angela) Wang
> >
> >
> >On Wed, Nov 19, 2014 at 9:08 PM, Lewis John Mcgibbney
> ><lewis.mcgibbney@gmail.com> wrote:
> >
> >Try removing 0.9 from that directory (copy elsewhere) and attempt to re
> >parse the directory.
> >Thanks
> >
> >
> >On Wed, Nov 19, 2014 at 8:36 PM, MengYing Wang <mengyingwang1@gmail.com>
> >wrote:
> >
> >Dear Lewis,
> >In feed, it is rome-0.9
> >(http://svn.apache.org/repos/asf/nutch/trunk/src/plugin/feed/ivy.xml).
> >While, in parse-Tika, it is rome-1.0
> >(http://svn.apache.org/repos/asf/nutch/trunk/src/plugin/parse-tika/plugin
> .
> >xml). I have enabled both feed and parse-tika in the nutch-site.xml.
> >Thanks.
> >
> >Best,
> >Mengying (Angela) Wang
> >
> >
> >
> >
> >On Wed, Nov 19, 2014 at 8:42 AM, Lewis John Mcgibbney
> ><lewis.mcgibbney@gmail.com> wrote:
> >
> >Which version of Rome feed parser is in your class path?It may be
> >activated via the Nutch 'feed' plugin or may also be come via Nutch
> >'parse-Tika' plugin.
> >Please determine which version(s) are in class path and which are being
> >used.
> >
> >On Wednesday, November 19, 2014, MengYing Wang <mengyingwang1@gmail.com>
> >wrote:
> >
> >
> >
> >Hi Everyone,
> >In the Nutch parse step, I received the following error. Does Anyone know
> >how to solve the problem? Appreciate for your help!
> >
> >$ /cygdrive/d/nutch_trunk/runtime/local/bin/nutch parse -D
> >mapred.reduce.tasks=2 -D mapred.child.java.opts=-Xmx1000m -D
> >mapred.reduce.tasks.speculative.execution=false -D
> >mapred.map.tasks.speculative.execution=false -D
> >mapred.compress.map.output=true -D
> >mapred.skip.attempts.to.start.skipping=2 -D
> >mapred.skip.map.max.skip.records=1 crawlId/segments/20141118235323
> >
> >java.lang.ExceptionInInitializerError
> >       at
> com.sun.syndication.io.SyndFeedInput.build(SyndFeedInput.java:136)
> >       at org.apache.tika.parser.feed.FeedParser.parse(FeedParser.java:70)
> >       at
> org.apache.nutch.parse.tika.TikaParser.getParse(TikaParser.java:103)
> >       at org.apache.nutch.parse.ParseUtil.parse(ParseUtil.java:95)
> >       at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:101)
> >       at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:44)
> >       at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
> >       at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
> >       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
> >       at
> >org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
> >Caused by: java.lang.NullPointerException
> >       at java.util.Properties$LineReader.readLine(Properties.java:434)
> >       at java.util.Properties.load0(Properties.java:353)
> >       at java.util.Properties.load(Properties.java:341)
> >       at
> >com.sun.syndication.io.impl.PropertiesLoader.<init>(PropertiesLoader.java:
> >74)
> >       at
> >com.sun.syndication.io.impl.PropertiesLoader.getPropertiesLoader(Propertie
> >sLoader.java:46)
> >       at
> >com.sun.syndication.io.impl.PluginManager.<init>(PluginManager.java:54)
> >       at
> >com.sun.syndication.io.impl.PluginManager.<init>(PluginManager.java:46)
> >       at
> >com.sun.syndication.feed.synd.impl.Converters.<init>(Converters.java:40)
> >       at
> >com.sun.syndication.feed.synd.SyndFeedImpl.<clinit>(SyndFeedImpl.java:59)
> >       ... 10 more
> >
> >--
> >Best,
> >Mengying (Angela) Wang
> >
> >
> >
> >
> >
> >
> >
> >--
> >You received this message because you are subscribed to the Google Groups
> >"nsf-polar-usc-students" group.
> >To unsubscribe from this group and stop receiving emails from it, send an
> >email to nsf-polar-usc-students+unsubscribe@googlegroups.com.
> >To post to this group, send email to
> >nsf-polar-usc-students@googlegroups.com.
> >Visit this group at http://groups.google.com/group/nsf-polar-usc-students
> .
> >To view this discussion on the web visit
> >
> https://groups.google.com/d/msgid/nsf-polar-usc-students/CAJX%3DLAuzcTtYe6
> >1Avq1EthNRYN6M-%2BGk%2B7PntdOYvQ4ZkrEJKw%40mail.gmail.com
> ><
> https://groups.google.com/d/msgid/nsf-polar-usc-students/CAJX%3DLAuzcTtYe
> >61Avq1EthNRYN6M-%2BGk%2B7PntdOYvQ4ZkrEJKw%
> 40mail.gmail.com?utm_medium=emai
> >l&utm_source=footer>.
> >For more options, visit https://groups.google.com/d/optout.
> >
> >
> >
> >
> >
> >--
> >Lewis
> >
> >
> >
> >
> >
> >
> >
> >--
> >Best,
> >Mengying (Angela) Wang
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >--
> >Lewis
> >
> >
> >
> >
> >
> >
> >
> >
> >--
> >Best,
> >Mengying (Angela) Wang
> >
> >
> >
> >
> >--
> >You received this message because you are subscribed to the Google Groups
> >"nsf-polar-usc-students" group.
> >To unsubscribe from this group and stop receiving emails from it, send an
> >email to nsf-polar-usc-students+unsubscribe@googlegroups.com.
> >To post to this group, send email to
> >nsf-polar-usc-students@googlegroups.com.
> >Visit this group at http://groups.google.com/group/nsf-polar-usc-students
> .
> >To view this discussion on the web visit
> >
> https://groups.google.com/d/msgid/nsf-polar-usc-students/CAJX%3DLAtOPqgH15
> >CVJMnNOjctsY-L0qQ3pc5Tj9YYCqagn_kYMA%40mail.gmail.com
> ><
> https://groups.google.com/d/msgid/nsf-polar-usc-students/CAJX%3DLAtOPqgH1
> >5CVJMnNOjctsY-L0qQ3pc5Tj9YYCqagn_kYMA%
> 40mail.gmail.com?utm_medium=email&ut
> >m_source=footer>.
> >For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "nsf-polar-usc-students" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to nsf-polar-usc-students+unsubscribe@googlegroups.com.
> To post to this group, send email to
> nsf-polar-usc-students@googlegroups.com.
> Visit this group at http://groups.google.com/group/nsf-polar-usc-students.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/nsf-polar-usc-students/D0938B13.19BB7C%25chris.mattmann%40gmail.com
> .
> For more options, visit https://groups.google.com/d/optout.
>



-- 
Best,
Mengying (Angela) Wang

Mime
View raw message