samza-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rick Mangi <r...@chartbeat.com>
Subject Re: Sporadic errors in JobRunner
Date Wed, 18 Nov 2015 21:20:27 GMT
That patch seems to have fixed the problem.


> On Nov 18, 2015, at 3:43 PM, Rick Mangi <rick@chartbeat.com> wrote:
> 
> Sorry. Just read the bug. Yes, that makes sense. I deleted a bunch of topics and then
hit this.
> 
> 
>> On Nov 18, 2015, at 3:42 PM, Rick Mangi <rick@chartbeat.com> wrote:
>> 
>> I take that back, it happened again. Will try your patch.
>> 
>> 
>>> On Nov 18, 2015, at 3:36 PM, Rick Mangi <rick@chartbeat.com> wrote:
>>> 
>>> I seem to have solved it by only specifying a single zookeeper node in my job
config. Maybe a race condition of some sort?
>>> 
>>> 
>>>> On Nov 18, 2015, at 2:37 PM, Yi Pan <nickpan47@gmail.com> wrote:
>>>> 
>>>> Hi, Rick,
>>>> 
>>>> I think that you are running into SAMZA-754. I have a RB available for it
>>>> already. I will upload the patch and it would be good if you can try the
>>>> patch to see whether that solves your problem.
>>>> 
>>>> -Yi
>>>> 
>>>> On Tue, Nov 17, 2015 at 12:01 PM, Rick Mangi <rick@chartbeat.com> wrote:
>>>> 
>>>>> Hi, getting things working on samza 0.10.0 finally :)
>>>>> 
>>>>> I’m seeing the following error about 1/4 of the time from run-job.sh
when
>>>>> starting jobs:
>>>>> 
>>>>> [yarnmaster01] out: 2015-11-17 14:56:00 KafkaSystemAdmin$ [INFO] Got
>>>>> metadata: Map(__samza_coordinator_t-key-grouper_dev -> SystemStreamMetadata
>>>>> [streamName=__samza_coordinator_t-key-grouper_dev,
>>>>> partitionMetadata={Partition [partition=0]=SystemStreamPartitionMetadata
>>>>> [oldestOffset=null, newestOffset=null, upcomingOffset=0]}])
>>>>> [yarnmaster01] out: Exception in thread "main"
>>>>> java.lang.NullPointerException
>>>>> [yarnmaster01] out:     at
>>>>> java.util.concurrent.ConcurrentHashMap.put(ConcurrentHashMap.java:1124)
>>>>> [yarnmaster01] out:     at
>>>>> scala.collection.convert.Wrappers$JMapWrapperLike$class.update(Wrappers.scala:257)
>>>>> [yarnmaster01] out:     at
>>>>> scala.collection.convert.Wrappers$JConcurrentMapWrapper.update(Wrappers.scala:348)
>>>>> [yarnmaster01] out:     at
>>>>> scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189)
>>>>> [yarnmaster01] out:     at
>>>>> scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91)
>>>>> [yarnmaster01] out:     at
>>>>> org.apache.samza.system.kafka.KafkaSystemConsumer.register(KafkaSystemConsumer.scala:108)
>>>>> [yarnmaster01] out:     at
>>>>> org.apache.samza.coordinator.stream.CoordinatorStreamSystemConsumer.register(CoordinatorStreamSystemConsumer.java:112)
>>>>> [yarnmaster01] out:     at
>>>>> org.apache.samza.job.JobRunner.run(JobRunner.scala:88)
>>>>> [yarnmaster01] out:     at
>>>>> org.apache.samza.job.JobRunner$.main(JobRunner.scala:43)
>>>>> [yarnmaster01] out:     at
>>>>> org.apache.samza.job.JobRunner.main(JobRunner.scala)
>>>>> [yarnmaster01] out:
>>>>> 
>>>>> 
>>>>> The same job will startup fine a minute later.
>>>>> 
>>> 
>> 
> 


Mime
View raw message