kafka-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karol Nowak <gryw...@gmail.com>
Subject Failed partition reassignment
Date Mon, 01 Dec 2014 14:31:06 GMT

I observed some error messages / exceptions while running partition
reassignment on kafka cluster. Being fairly new to this system I'm
not sure if these indicate serious failures or transient problems, or if
manual intervention is needed.

I used kafka-reassign-partitions.sh to reassign partitions from brokers
{143,155,155,93} to {143,155,115,68} on a healthy (?) cluster. Right now
one partition has just two replicas in the ISR and a number of partitions
is left with 4 partitions in ISR even though replication factor is 3. Logs
show a few zookeeper timeouts, but there were no GC pauses anywhere near
the session timeout. Zookeeper itself seems healthy and not overloaded,
with exception of regular CPU spikes, probably related to snapshots.

I cleaned the log lines a little bit for brevity.

First example: https://gist.github.com/knowak/a682afc1545fdeb836a1
Second one with two similar stack traces:
Third one, many many of these:
Fourth: https://gist.github.com/knowak/1fbde5ca90d8f1924141



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message