spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <>
Subject Re: Error: Unrecognized option '--conf' (trying to set auto.offset.reset)
Date Sat, 22 Nov 2014 22:17:52 GMT
First, the --conf error: What version of Spark? I don't think some of
these existed before 1.1 so that may be the issue. This is all on one
line I assume. Quoting is not an issue here.

The real issue is that auto.reset.offset is indeed a Kafka option.
It's not a system property; if it were, you could set this for the
Spark driver/executor with a different set of options.

You actually have to set this in your code, where you configure the
Spark Streaming Kafka DStream. It's an extra key-value pair you put
into the config Map.

On Fri, Nov 21, 2014 at 3:58 PM, YaoPau <> wrote:
> I'm trying to configure my Spark Streaming + Kafka job so it always pulls
> real-time data.  I think setting auto.offset.reset="largest" will do it, but
> when I try to set that configuration at runtime I get an error:
> *spark-submit --class com.autotrader.scalaspark.sbStreaming --master
> yarn-client --driver-memory 10g --executor-memory 10g --conf
> auto.offset.reset="largest"
> sbStreaming-0.0.1-SNAPSHOT-jar-with-dependencies.jar
> Error: Unrecognized option '--conf'.*
> How can I configure my job to pull the latest data?
> --
> View this message in context:
> Sent from the Apache Spark User List mailing list archive at
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message