Sarah's thoughts on MySQL: Redundant management nodes in MySQL Cluster

Thursday, August 27, 2009

Redundant management nodes in MySQL Cluster

Every time I teach the MySQL Cluster architecture, someone inevitably asks "Isn't the management node (ndb_mgmd) a single point of failure?" The short answer: no. The management node is not a SPOF because the cluster can continue without it. However, it's inconvenient if your management node is down because the management node does several things such as:

Provide status information about the cluster and allow you to use the ndb_mgm for various maintenance tasks like taking a hot backup
Own the cluster config file (therefore it must be running to start a node)
Arbitration in case of a potential split-brain
Logging

So while the management node can be down, it is nice to have a redundant one for failover. This is very easy to do:

Add 2 [NDB_MGMD] sections to config.ini:
[NDB_MGMD]
#Id is required when defining multiple mgmt nodes
Id=1
Hostname=192.168.0.31

[NDB_MGMD]
Id=2
Hostname=192.168.0.32
Change the ndb-connectstring to include both IPs of the management nodes:
[mysql_cluster]
ndb-connectstring=192.168.0.31,192.168.0.32
Make sure the config.ini is on both management nodes and that the files are identical. Start both ndb_mgmd nodes.

That's it! The management nodes will act in an active-passive way and failover as necessary. Make sure you do not run any management node on the same physical host as a data node - it will cause a cluster shutdown if they fail simultaneously.

5 comments:

sohanMarch 10, 2014 at 11:43 PM
Nice Information. Can you tell me what about the data nodes here, how are they configured. Assume I have two data node, so these two data nodes will connect to bot the management nodes?

Thanks
ReplyDelete
Replies
Perry HarringtonOctober 8, 2014 at 11:03 AM
The --ndb-connectstring option the ndb data nodes are started with is the same format as the ndb-connectstring option in the my.cnf. eg: ndbd --ndb-connectstring="192.168.0.31,192.168.0.32"
ReplyDelete
Replies
UnknownApril 4, 2016 at 1:51 AM
thanks for your post can I ask you same quesions on your email about the cluster?
ReplyDelete
Replies
Rizwan KhanApril 25, 2016 at 4:10 AM
Why does the cluster shutdown occurs when the data and the management node are on the same host or say a blade and a full blade goes down. Is it because of the race condition? Cant the management node first fail-over and then the data node could talk to the second management node?
ReplyDelete
Replies
UnknownDecember 24, 2017 at 8:29 AM
Thanks Sarah, really helpful.
ReplyDelete
Replies

Add comment

Sarah's thoughts on MySQL

Thursday, August 27, 2009

Redundant management nodes in MySQL Cluster

5 comments:

Followers

Blog Archive

About Me