samza-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rick Mangi <r...@chartbeat.com>
Subject Re: Sporadic errors in JobRunner
Date Wed, 18 Nov 2015 20:42:39 GMT
I take that back, it happened again. Will try your patch.


> On Nov 18, 2015, at 3:36 PM, Rick Mangi <rick@chartbeat.com> wrote:
> 
> I seem to have solved it by only specifying a single zookeeper node in my job config.
Maybe a race condition of some sort?
> 
> 
>> On Nov 18, 2015, at 2:37 PM, Yi Pan <nickpan47@gmail.com> wrote:
>> 
>> Hi, Rick,
>> 
>> I think that you are running into SAMZA-754. I have a RB available for it
>> already. I will upload the patch and it would be good if you can try the
>> patch to see whether that solves your problem.
>> 
>> -Yi
>> 
>> On Tue, Nov 17, 2015 at 12:01 PM, Rick Mangi <rick@chartbeat.com> wrote:
>> 
>>> Hi, getting things working on samza 0.10.0 finally :)
>>> 
>>> I’m seeing the following error about 1/4 of the time from run-job.sh when
>>> starting jobs:
>>> 
>>> [yarnmaster01] out: 2015-11-17 14:56:00 KafkaSystemAdmin$ [INFO] Got
>>> metadata: Map(__samza_coordinator_t-key-grouper_dev -> SystemStreamMetadata
>>> [streamName=__samza_coordinator_t-key-grouper_dev,
>>> partitionMetadata={Partition [partition=0]=SystemStreamPartitionMetadata
>>> [oldestOffset=null, newestOffset=null, upcomingOffset=0]}])
>>> [yarnmaster01] out: Exception in thread "main"
>>> java.lang.NullPointerException
>>> [yarnmaster01] out:     at
>>> java.util.concurrent.ConcurrentHashMap.put(ConcurrentHashMap.java:1124)
>>> [yarnmaster01] out:     at
>>> scala.collection.convert.Wrappers$JMapWrapperLike$class.update(Wrappers.scala:257)
>>> [yarnmaster01] out:     at
>>> scala.collection.convert.Wrappers$JConcurrentMapWrapper.update(Wrappers.scala:348)
>>> [yarnmaster01] out:     at
>>> scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189)
>>> [yarnmaster01] out:     at
>>> scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91)
>>> [yarnmaster01] out:     at
>>> org.apache.samza.system.kafka.KafkaSystemConsumer.register(KafkaSystemConsumer.scala:108)
>>> [yarnmaster01] out:     at
>>> org.apache.samza.coordinator.stream.CoordinatorStreamSystemConsumer.register(CoordinatorStreamSystemConsumer.java:112)
>>> [yarnmaster01] out:     at
>>> org.apache.samza.job.JobRunner.run(JobRunner.scala:88)
>>> [yarnmaster01] out:     at
>>> org.apache.samza.job.JobRunner$.main(JobRunner.scala:43)
>>> [yarnmaster01] out:     at
>>> org.apache.samza.job.JobRunner.main(JobRunner.scala)
>>> [yarnmaster01] out:
>>> 
>>> 
>>> The same job will startup fine a minute later.
>>> 
> 


Mime
View raw message