I have a RedShift cluster of 4 nodes.
- When one of the nodes goes down, will the entire cluster become unavailable?
- If yes - for how long?
- When the cluster gets back - is it returned to exactly the same point it was before the failure, or the data may be rolled back a to S3 snapshot from a few hours ago?
- How can I simulate this situation to check this scenario by myself?
Thanks a lot!
If it's a single node failure - amazon will start a new node and stream data from other nodes (each block is written to two different nodes if any). In such case, we can expect:
In case more than one nodes fails, redshift will restore itself from the latest S3 backup. S3 backups are done on the following occasions: