Node stuck in LEAVING state after nodetool decommission even when data streaming has completed

265 views Asked by At

We have a cluster running with 4.0.1 verion.There is an issue with decommission process in this version.The issue is that The decommission gets stuck after data streaming was compelted to replica's in the cluster.The node status remains showing leaving even after 12hours and many compaction requests are getting queued up in the node. We observed this after upgrading 4.0.1 from 3.11 version. Did anyone face this kind of issue? If yes,What's the resolution for this? I tried in multiple way but It remains same. Can anyone please check and answer it?

Regards, Mani

I tried this https://support.datastax.com/s/article/Node-stuck-in-LEAVING-state-after-being-decommissioned but It didn't work.

1

There are 1 answers

1
Erick Ramirez On

If the streams in the nodetool netstats output show as 100% then it means that the node is waiting for the receiving replicas to acknowledge completion.

A common cause of the delay is when there are several indexes defined in the schema. If this is the case then the receiving replicas are still busy indexing the data that got streamed to them so they won't send an acknowledgement to the source node (node being decommissioned) and the stream is not considered complete yet.

You can check the progress of the indexing on the receiving replicas by running nodetool compactionstats. All receiving replicas must acknowledge completion on their end before all the streams on the source node is considered complete. Cheers!


Please support the Apache Cassandra community by hovering over the tag then click on the Watch tag button. Thanks!