nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (Jira)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer
Date Mon, 09 Nov 2020 01:10:00 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228301#comment-17228301
] 

Hudson commented on NUTCH-2823:
-------------------------------

FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #11 (See [https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/11/])
NUTCH-2823 IllegalStateException in IndexWriters.describe() when validating url param for
SolrIndexer (snagel: [https://github.com/apache/nutch/commit/96bd7577b7276c91f01e6b226742805b481151b4])
* (edit) src/java/org/apache/nutch/indexer/IndexWriters.java


> IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer
> ------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-2823
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2823
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer, plugin
>    Affects Versions: 1.16, 1.17
>            Reporter: Joe Gilvary
>            Assignee: Sebastian Nagel
>            Priority: Minor
>             Fix For: 1.18
>
>
> The string validation for the IndexWriters.describe() fails when the value in index-writers.xml
is too long.
> I encountered the exception when using three comma-separated URL values in a config that
worked for Nutch 1.15.The schema doesn't allow multiple values, but the documentation says
a comma-separated list works.
> Indexing ran without the exception when I changed to use only one host's URL (Solr Cloud).
Sebastian duplicated the error with a long string value for the param, so it's not directly
due to the comma separated values.
> While googling I found this thread in the archives where Markus encountered it going
from 1.15 to 1.16:
> mail-archives.apache.org/mod_mbox/nutch-user/201910.mbox/<05eda22b-14b2-309f-3bc7-d6d85c218235@googlemail.com>
> I also found a change in 1.16 that might be relevant: NUTCH-2602
>  https://issues.apache.org/jira/browse/NUTCH-2602
> My stack trace:
> {{java.lang.Exception: java.lang.IllegalStateException: text width is less than 1, was
<-26>}}
>  \{{ at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:492)}}
>  \{{ at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:559)}}
>  {{Caused by: java.lang.IllegalStateException: text width is less than 1, was <-26>}}
>  \{{ at org.apache.commons.lang3.Validate.validState(Validate.java:829)}}
>  \{{ at de.vandermeer.skb.interfaces.transformers.textformat.Text_To_FormattedText.transform(Text_To_FormattedText.java:215)}}
>  \{{ at de.vandermeer.asciitable.AT_Renderer.renderAsCollection(AT_Renderer.java:250)}}
>  \{{ at de.vandermeer.asciitable.AT_Renderer.render(AT_Renderer.java:128)}}
>  \{{ at de.vandermeer.asciitable.AsciiTable.render(AsciiTable.java:191)}}
>  \{{ at org.apache.nutch.indexer.IndexWriters.describe(IndexWriters.java:326)}}
>  \{{ at org.apache.nutch.indexer.IndexerOutputFormat.getRecordWriter(IndexerOutputFormat.java:45)}}
>  \{{ at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.<init>(ReduceTask.java:542)}}
>  \{{ at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:615)}}
>  \{{ at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:390)}}
>  \{{ at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:347)}}
>  \{{ at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)}}
>  \{{ at java.util.concurrent.FutureTask.run(FutureTask.java:266)}}
>  \{{ at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)}}
>  \{{ at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)}}
>  \{{ at java.lang.Thread.run(Thread.java:748)}}
>  
>  Thanks,
>  Joe



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message