{"id":1300,"date":"2012-09-03T14:29:28","date_gmt":"2012-09-03T06:29:28","guid":{"rendered":"http:\/\/rmohan.com\/?p=1300"},"modified":"2012-09-03T14:29:28","modified_gmt":"2012-09-03T06:29:28","slug":"redhat-cluster-howto","status":"publish","type":"post","link":"https:\/\/mohan.sg\/?p=1300","title":{"rendered":"RedHat Cluster howto"},"content":{"rendered":"<div>\n<h1>Introduction<\/h1>\n<p>Here I wrote up a little tutorial how to configure a standard RHEL cluster. Configuring a RHEL cluster is quite easy but documentation is sparse and not well organized. We will configure a 4 nodes cluster with shared storage and Heatbeat over a different NIC (not the main data link).<\/p>\n<p><a name=\"Cluster_configuration_goals\"><\/a><\/p>\n<h2>Cluster configuration goals<\/h2>\n<ul>\n<li>Shared storage<\/li>\n<li>HA-LVM: lvm failover configuration (like HP ServiceGuard) is different from clustered logical volume manager (clvm)!!<\/li>\n<li>Bonded main data link (eg. bond0 \u2013&gt; eth0 + eth1)<\/li>\n<li>Hearthbeat on a different data link (eg. eth2)<\/li>\n<\/ul>\n<p><a name=\"Cluster_installation_steps\"><\/a><\/p>\n<h1>Cluster installation steps<\/h1>\n<p><a name=\"OS_installation\"><\/a><\/p>\n<h2>OS installation<\/h2>\n<p>First we performed a full CentOS 5.5 installation using kickstart, we also installed cluster packages like:<\/p>\n<ul>\n<li>cman<\/li>\n<li>rgmanager<\/li>\n<li>qdiskd<\/li>\n<li>ccs_tools<\/li>\n<\/ul>\n<p>or<\/p>\n<ul>\n<li>@clustering (kickstart group)<\/li>\n<\/ul>\n<p><a name=\"Networking_configuration\"><\/a><\/p>\n<h2>Networking configuration<\/h2>\n<p>We configure 2 different data link:<\/p>\n<ol>\n<li>Main data link (for applications)<\/li>\n<li>Heartbeat data link (for cluster communication)<\/li>\n<\/ol>\n<p>Main data link (bond0) uses ethernet bonding over 2 phisycal eth (eth0, eth1). This configuration assures network high availability when some network paths fail.<\/p>\n<p>Cluster communication (heartbeat) uses a dedicated ethernet link (eth2), configured in a diffentent network and vlan.<\/p>\n<p>To obtain such configuration cerate this file\u00a0<code>\/etc\/sysconfig\/network-scripts\/ifcfg-bond0<\/code> from scratch and fill it as below:<\/p>\n<pre>DEVICE=bond0\r\nIPADDR=&lt;your server main IP address (eg. 10.200.56.41)&gt;\r\nNETMASK=&lt;your server main network mask (eg. 255.255.255.0)&gt;\r\nNETWORK=&lt;your server main network (eg. 10.200.56.0)&gt;\r\nBROADCAST=&lt;your server main network broadcast (eg. 10.200.56.255)&gt;\r\nONBOOT=yes\r\nBOOTPROTO=none\r\nUSERCTL=no\r\nBONDING_OPTS='miimon=100 mode=1'\r\nGATEWAY=&lt;your server main default gateway (eg. 10.200.56.1)&gt;\r\nTYPE=Ethernet<\/pre>\n<p>You can customize\u00a0<code>BONDING_OPT<\/code>. Please see bonding documentation.<\/p>\n<p>Modify\u00a0<code>\/etc\/sysconfig\/network-scripts\/ifcfg-eth{0,1}<\/code>:<\/p>\n<pre>DEVICE=&lt;eth0 or eth1, etc...&gt;\r\nUSECTL=no\r\nBOOTPROTO=none\r\nMASTER=bond0\r\nSLAVE=yes\r\nHWADDR=&lt;your eth MAC address (eg. 00:23:7d:3c:18:40)&gt;\r\nONBOOT=yes\r\nTYPE=Ethernet<\/pre>\n<p>Modify heartbeat nic\u00a0<code>\/etc\/sysconfig\/network-scripts\/ifcfg-eth2<\/code>:<\/p>\n<pre>DEVICE=eth2\r\nHWADDR=&lt;your eth MAC address (eg. 00:23:7D:3C:CE:96)&gt;\r\nONBOOT=yes\r\nBOOTPROTO=none\r\nTYPE=Ethernet\r\nNETMASK=&lt;your server heartbeat network mask (eg. 255.255.255.0)&gt;\r\nIPADDR=&lt;your server main IP address (eg. 192.168.133.41)&gt;<\/pre>\n<p>Note that heartbeat\u00a0<code>eth2<\/code> has no default gateway configured. Normally this is not required unless this node is outside other node\u2019s network and there are not specific static routes.<\/p>\n<p>Add this line to\u00a0<code>\/etc\/modprobe.conf<\/code>:<\/p>\n<pre>alias bond0 bonding<\/pre>\n<p>Add to\u00a0<code>\/etc\/hosts<\/code> the informations about\u00a0<strong>each<\/strong> cluster node and replicate the file among the nodes:<\/p>\n<pre># These are example!!!\r\n10.200.56.41            artu.yourdomain.com artu\r\n192.168.133.41          h-artu.yourdomain.com h-artu\r\n\r\n10.200.56.42            ginevra.yourdomain.com ginevra\r\n192.168.133.42          h-ginevra.yourdomain.com h-ginevra\r\n\r\n10.200.56.43            morgana.yourdomain.com morgana\r\n192.168.133.43          h-morgana.yourdomain.com h-morgana\r\n\r\n10.200.56.44            lancelot.yourdomain.com lancelot\r\n192.168.133.44          h-lancelot.yourdomain.com h-lancelot<\/pre>\n<p><a name=\"Logical_Volume_Manager_configuration\"><\/a><\/p>\n<h2>Logical Volume Manager configuration<\/h2>\n<p>We choose not to use clustered logical volume manager (clvmd, sometimes called LVMFailover) but to use HA-LVM instead. HA-LVM is totally different from clvmd and it is quite similar di HP ServiceGuard behaviour.<\/p>\n<p><a name=\"HA-LVM_features\"><\/a><\/p>\n<h3>HA-LVM features<\/h3>\n<ul>\n<li>No needs to run any daemon (like clvmd aka LVMFailover)<\/li>\n<li>Each volume group can be activated exclusively on one node at a time<\/li>\n<li>Volume group configuration is not replicated automatically among the nodes (need to run vgscan on the nodes)<\/li>\n<li>Implementation not dipendent of the cluster status (can work without cluster running at all)<\/li>\n<\/ul>\n<p><a name=\"HA-LVM_howto\"><\/a><\/p>\n<h3>HA-LVM howto<\/h3>\n<p>Configure\u00a0<code>\/etc\/lvm\/lvm.conf<\/code> as below:<\/p>\n<p>Substitute existing\u00a0<code>filter<\/code> with:<\/p>\n<pre>filter = [ \"a\/dev\/mpath\/.*\/\", \"a\/c[0-9]d[0-9]p[0-9]$\/\", \"a\/sd*\/\", \"r\/.*\/\" ]<\/pre>\n<p>check\u00a0<code>locking_type<\/code>:<\/p>\n<pre>locking_type = 1<\/pre>\n<p>substitute existing\u00a0<code>volume_list<\/code> with:<\/p>\n<pre>volume_list = [ \"vg00\", \"&lt;quorum disk volume group&gt;\", \"@&lt;hostname related to heartbeat nic&gt;\" ]<\/pre>\n<p>Where:<\/p>\n<ul>\n<li>vg00 is the name of the root volume group (always active)<\/li>\n<li>&lt;quorum disk volume group&gt; is the name of the quorum disk volume group (always active)<\/li>\n<li>@&lt;hostname related to heartbeat nic&gt; is a tag. Each volume group can have one tag at a time. Cluster lvm agents tag the volume groups with the hostname (present into configuration) in order to activate them. LVM activate only volume groups that contain such tag. In this way each volume group tagged can be activated and accessed by one node at a time (because of volume_list settings)<\/li>\n<\/ul>\n<p>At the end remember to regenerate initrd!<\/p>\n<pre># mkinitrd -f \/boot\/initrd-$(uname -r).img $(uname -r)<\/pre>\n<p><a name=\"Storage_configuration\"><\/a><\/p>\n<h2>Storage configuration<\/h2>\n<p>Depending of your storage system, you should configure multipath, and\u00a0<strong>each<\/strong> should be able to access to the same luns.<\/p>\n<p><a name=\"Quorum_disk\"><\/a><\/p>\n<h2>Quorum disk<\/h2>\n<p>Quorum disk is a 20MB LUN shared on the storage to all cluster nodes. This disk is used by the cluster to tie-break in case of split-brain events. Each node update its own information to the quorum disk. If some nodes experience network problems, the quorum disk assures that only the\u00a0<strong>right<\/strong> group of nodes form the cluster but\u00a0<strong>not<\/strong> both (split-brain)!<\/p>\n<p><a name=\"Quorum_disk_creation\"><\/a><\/p>\n<h3>Quorum disk creation<\/h3>\n<p>First be sure that\u00a0<strong>each<\/strong> node can see the same 20MB LUN. Then, on the first node, create a physical volume:<\/p>\n<pre># pvcreate \/dev\/mpath1<\/pre>\n<p>create a dedicated volume group:<\/p>\n<pre># vgcreate -s 8 vg_qdisk \/dev\/mpath1<\/pre>\n<p>create a logical volume and extend it to maximun volume group size:<\/p>\n<pre># lvcreate -l &lt;max_vg_pe&gt; -n lv_qdisk vg_qdisk<\/pre>\n<p><strong>Make sure that this volume group is present into\u00a0<code>volume_list<\/code> inside\u00a0<code>\/etc\/lvm\/lvm.conf<\/code>. It should be activated on all nodes!<\/strong><\/p>\n<p>On the other nodes perform a:<\/p>\n<pre># vgscan<\/pre>\n<p>Should appear the quorum disk volume group.<\/p>\n<p><a name=\"Quorum_disk_configuration\"><\/a><\/p>\n<h3>Quorum disk configuration<\/h3>\n<p>Now we have to populate quorum disk space with the right information. To perform this type:<\/p>\n<pre># mkqdisk -c \/dev\/vg_qdisk\/lv_qdisk -l &lt;your_cluster_name&gt;<\/pre>\n<p>Note that is not required to use your cluster name as quorum disk label, but it is recommended.<\/p>\n<p>You need also to create a heuristic script to help qdisk when acting as tie-breaker. Create\u00a0<code>\/usr\/share\/cluster\/check_eth_link.sh<\/code>:<\/p>\n<pre>#!\/bin\/sh\r\n# Network link status checker\r\n\r\nethtool $1 | grep -q \"Link detected.*yes\"\r\nexit $?<\/pre>\n<p>Now activate the quorum disk:<\/p>\n<pre># service qdiskd start\r\n# chkconfig qdiskd on<\/pre>\n<p><a name=\"Logging_configuration\"><\/a><\/p>\n<h2>Logging configuration<\/h2>\n<p>In order to assure a good logging you can choose to log the rgmanager to a specific file.<\/p>\n<p>Add this lines to\u00a0<code>\/etc\/syslog.conf<\/code>:<\/p>\n<pre># Red Hat Cluster\r\nlocal4.* \/var\/log\/rgmanager<\/pre>\n<p>Add\u00a0<code>\/var\/log\/rgmanager<\/code> to logrotate syslog settings in\u00a0<code>\/etc\/logrotate.d\/syslog<\/code>:<\/p>\n<pre>\/var\/log\/messages \/var\/log\/secure \/var\/log\/maillog \/var\/log\/spooler \/var\/log\/boot.log \/var\/log\/cron \/var\/log\/rgmanager {\r\n    sharedscripts\r\n    postrotate\r\n        \/bin\/kill -HUP `cat \/var\/run\/syslogd.pid 2&gt; \/dev\/null` 2&gt; \/dev\/null || true\r\n        \/bin\/kill -HUP `cat \/var\/run\/rsyslogd.pid 2&gt; \/dev\/null` 2&gt; \/dev\/null || true\r\n    endscript\r\n}<\/pre>\n<p>Modify this line in\u00a0<code>\/etc\/cluster\/cluster.conf<\/code>:<\/p>\n<pre>&lt;rm log_facility=\"local4\" log_level=\"5\"&gt;<\/pre>\n<p>Increment\u00a0<code>\/etc\/cluster\/cluster.conf<\/code> version and update on all nodes:<\/p>\n<pre># ccs_tool update \/etc\/cluster\/cluster.conf<\/pre>\n<p><a name=\"Cluster_configuration\"><\/a><\/p>\n<h2>Cluster configuration<\/h2>\n<p>For configuring cluster you can choose to use:<\/p>\n<ul>\n<li>Luci web interface<\/li>\n<li>Manual xml configuration<\/li>\n<\/ul>\n<p><a name=\"Configuring_cluster_using_luci\"><\/a><\/p>\n<h3>Configuring cluster using luci<\/h3>\n<p>In order to use luci web interface you need to activate service\u00a0<code>ricci<\/code> on\u00a0<strong>all<\/strong> nodes and\u00a0<code>luci<\/code> on one node only:<\/p>\n<pre>(on all nodes)\r\n# chkconfig ricci on\r\n# service ricci start<\/pre>\n<pre>(choose only a node)\r\n# chkconfig luci on\r\n# luci_admin init\r\n# service luci restart<\/pre>\n<p>Please note that\u00a0<code>luci_admin init<\/code> must be executed only the first time and\u00a0<strong>before<\/strong> starting luci service, otherwise luci will be unusable.<\/p>\n<p>now connect to luci:\u00a0<a title=\"https:\/\/node_with_luci.mydomain.com:8084\" href=\"https:\/\/node_with_luci.mydomain.com:8084\/\" rel=\"nofollow\">https:\/\/node_with_luci.mydomain.com:8084<\/a> Here you can create a cluster, add nodes, create services, failover domains etc\u2026<\/p>\n<p>See\u00a0<a title=\"Recommended cluster configuration\" href=\"http:\/\/www.nxnt.org\/2010\/09\/redhat-cluster-howto\/#Recommended_cluster_configuration\">Recommended cluster configuration<\/a> to learn the right settings for the cluster.<\/p>\n<p><a name=\"Configuring_cluster_editing_the_XML\"><\/a><\/p>\n<h3>Configuring cluster editing the XML<\/h3>\n<p>You can also manually configure a cluster editing its main config file\u00a0<code>\/etc\/cluster\/cluster.conf<\/code>. To create the config skeleton use:<\/p>\n<pre># ccs_tool create<\/pre>\n<p>now the just created config file is not yet usable, you should configure cluster settings, add nodes, create services, failover domains etc\u2026<\/p>\n<p>When config file is complete,\u00a0<strong>copy the file on all nodes<\/strong> and start the cluster in this way:<\/p>\n<pre>(on all nodes)\r\n# chkconfig cman on\r\n# chkconfig rgmanager on\r\n# service cman start\r\n# service rgmanager start<\/pre>\n<p>See\u00a0<a title=\"Recommended cluster configuration\" href=\"http:\/\/www.nxnt.org\/2010\/09\/redhat-cluster-howto\/#Recommended_cluster_configuration\">Recommended cluster configuration<\/a> to learn the right settings for the cluster.<\/p>\n<p>See\u00a0<a title=\"Useful cluster commands\" href=\"http:\/\/www.nxnt.org\/2010\/09\/redhat-cluster-howto\/#Useful_cluster_commands\">Useful cluster commands<\/a> to learn some useful console cluster commands to use.<\/p>\n<p><a name=\"Configuring_cluster_editing_the_XML\"><\/a><br \/> <a name=\"Recommended_cluster_configuration\"><\/a><\/p>\n<h3>Recommended cluster configuration<\/h3>\n<p>Here is attached a\u00a0<code>\/etc\/cluster\/cluster.conf<\/code> file of a fully configured cluster.<\/p>\n<p>For commenting purposes, the file is splitted into several consecutive parts:<\/p>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n2\r\n3\r\n4\r\n5\r\n6\r\n7\r\n8\r\n9\r\n10\r\n11\r\n12\r\n13\r\n14\r\n15\r\n16\r\n<\/pre>\n<\/td>\n<td>\n<pre>&lt;?xml version=\"1.0\"?&gt;\r\n&lt;cluster alias=\"jcaps_prd\" config_version=\"26\" name=\"jcaps_prd\"&gt;\r\n        &lt;fence_daemon clean_start=\"0\" post_fail_delay=\"0\" post_join_delay=\"3\"\/&gt;\r\n        &lt;clusternodes&gt;\r\n                &lt;clusternode name=\"h-lancelot.yourdomain.com\" nodeid=\"1\" votes=\"1\"&gt;\r\n                        &lt;fence\/&gt;\r\n                &lt;\/clusternode&gt;\r\n                &lt;clusternode name=\"h-artu.yourdomain.com\" nodeid=\"2\" votes=\"1\"&gt;\r\n                        &lt;fence\/&gt;\r\n                &lt;\/clusternode&gt;\r\n                &lt;clusternode name=\"h-morgana.yourdomain.com\" nodeid=\"3\" votes=\"1\"&gt;\r\n                        &lt;fence\/&gt;\r\n                &lt;\/clusternode&gt;\r\n        &lt;\/clusternodes&gt;\r\n        &lt;cman expected_votes=\"4\"\/&gt;\r\n        &lt;fencedevices\/&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This is the first part of the XML cluster config file.<\/p>\n<ul>\n<li>First line describes the cluster name and the\u00a0<code>config_version<\/code>. Each time you modify the XML you must increment the config_version by 1 prior to update the config on all nodes.<\/li>\n<li>Fence deamon line is the default one.<\/li>\n<li>Cluster node stanza contains the nodes of the cluster. Note that\u00a0<code>name<\/code> property contains the FQDN of the name. This name determines the eth used for cluster communication. In this example we don\u2019t use the main hostname but the hostname related to the eth we choose to use as cluster communication channel.<\/li>\n<li>Note also that the line\u00a0<code>&lt;fence\/&gt;<\/code> is required. Note that here we\u00a0<strong>do not<\/strong> use any fence device. Due to the nature of HA-LVM the access to the data sould be exclusive by one node at a time.<\/li>\n<li>Cman\u00a0<code>expected_votes<\/code> is 4 because each node give 1 vote each.<\/li>\n<\/ul>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n2\r\n3\r\n4\r\n5\r\n6\r\n7\r\n8\r\n9\r\n<\/pre>\n<\/td>\n<td>\n<pre>        &lt;rm log_facility=\"local4\" log_level=\"5\"&gt;\r\n                &lt;failoverdomains&gt;\r\n                        &lt;failoverdomain name=\"jcaps_prd\" nofailback=\"0\" ordered=\"0\" restricted=\"1\"&gt;\r\n                                &lt;failoverdomainnode name=\"h-lancelot.yourdomain.com\" priority=\"1\"\/&gt;\r\n                                &lt;failoverdomainnode name=\"h-artu.yourdomain.com\" priority=\"1\"\/&gt;\r\n                                &lt;failoverdomainnode name=\"h-morgana.yourdomain.com\" priority=\"1\"\/&gt;\r\n                        &lt;\/failoverdomain&gt;\r\n                &lt;\/failoverdomains&gt;\r\n                &lt;resources\/&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This section\u00a0<strong>begins<\/strong> resource manager configuration (<code>&lt;rm ...&gt;<\/code>).<\/p>\n<ul>\n<li>Resource manager section can be configured for logging. Rm logs to syslog, here we configured the\u00a0<code>log_facility<\/code> and the logging level. The facility we specified allows us to log to a separate file (see\u00a0<a title=\"Logging configuration\" href=\"http:\/\/www.nxnt.org\/2010\/09\/redhat-cluster-howto\/#Logging_configuration\">logging configuration<\/a>)<\/li>\n<li>We configured also a failover domain containing all cluster node. We want that a service can switch to all cluster nodes, but you can also configure different behaviours here.<\/li>\n<\/ul>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n2\r\n3\r\n4\r\n5\r\n6\r\n7\r\n8\r\n9\r\n<\/pre>\n<\/td>\n<td>\n<pre>        &lt;service autostart=\"1\" domain=\"jcaps_prd\" exclusive=\"0\" name=\"subversion\" recovery=\"relocate\"&gt;\r\n                &lt;ip address=\"10.200.56.60\" monitor_link=\"1\"\/&gt;\r\n                &lt;lvm name=\"vg_subversion_apps\" vg_name=\"vg_subversion_apps\"\/&gt;\r\n                &lt;lvm name=\"vg_subversion_data\" vg_name=\"vg_subversion_data\"\/&gt;\r\n                &lt;fs device=\"\/dev\/vg_subversion_apps\/lv_apps\" force_fsck=\"1\" force_unmount=\"1\" fsid=\"61039\" fstype=\"ext3\" mountpoint=\"\/apps\/subversion\" name=\"svn_apps\" self_fence=\"0\"&gt;\r\n                    &lt;fs device=\"\/dev\/vg_subversion_data\/lv_repositories\" force_fsck=\"1\" force_unmount=\"1\" fsid=\"3193\" fstype=\"ext3\" mountpoint=\"\/apps\/subversion\/repositories\" name=\"svn_repositories\" self_fence=\"0\"\/&gt;\r\n                &lt;\/fs&gt;\r\n                &lt;script file=\"\/my_cluster_scripts\/subversion\/subversion.sh\" name=\"subversion\"\/&gt;\r\n        &lt;\/service&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This section contains the services in the cluster (like HP ServiceGuard packages)<\/p>\n<ul>\n<li>We choose the failover domain (in this case our failover domain contains all nodes so the service can run on all nodes)<\/li>\n<li>We add a ip address resource (use always monitor link!)<\/li>\n<li>We use also a HA-LVM resource (<code>&lt;lvm ...&gt;<\/code>). Here all VG specified will be tagged with the node name when activating. This means that they can be activated\u00a0<strong>only<\/strong> on the node where the service is running (only on that node!).\u00a0<strong>Note: If you do not specify any LV, all the LVs inside the VG will be activated!<\/strong><\/li>\n<li>Next there are also\u00a0<code>&lt;fs ...&gt;<\/code> tags for mounting filesystem resources. It is recommended to use\u00a0<code>force_unmount<\/code> and\u00a0<code>force_fsck<\/code>.<\/li>\n<li>You can specify also a custom script for starting application\/services and so on. Please note that the script must be LSB compliant. This means that it must handle\u00a0<em>start|stop|status<\/em>. Note also that default cluster behaviour is to run the script with\u00a0<em>status<\/em> parameter every 30 seconds. If the script status does not return 0, the service will be marked as failed (and probably will be restarted\/relocated).<\/li>\n<\/ul>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n<\/pre>\n<\/td>\n<td>\n<pre>        &lt;\/rm&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This section closes the resource manager configuration (closes XML tag).<\/p>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n<\/pre>\n<\/td>\n<td>\n<pre>        &lt;totem consensus=\"4800\" join=\"60\" token=\"20000\" token_retransmits_before_loss_const=\"20\"\/&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This is a crucial part of cluster configuration. Here you specify the failure detection time of cluster.<\/p>\n<ul>\n<li>RedHat recommends to the CMAN membership (token) timeout value to be at least times that of the qdiskd timeout value. Here the value is 20 seconds.<\/li>\n<\/ul>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n2\r\n3\r\n<\/pre>\n<\/td>\n<td>\n<pre>        &lt;quorumd interval=\"2\" label=\"jcaps_prd_qdisk\" min_score=\"2\" tko=\"5\" votes=\"1\"&gt;\r\n                &lt;heuristic interval=\"2\" program=\"\/usr\/share\/cluster\/check_eth_link.sh bond0\" score=\"3\"\/&gt;\r\n        &lt;\/quorumd&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>Here we configure the quorum disk to be used by the cluster.<\/p>\n<ul>\n<li>We choose a quorum timeout value of 10 seconds (quorumd interval * quorumd tko) which is a half of token timeout (20 seconds).<\/li>\n<li>We insert also a heuristic script to determine the network health. This will help qdisk to take a decision when split-brain happens.<\/li>\n<\/ul>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<pre>1\r\n<\/pre>\n<\/td>\n<td>\n<pre>&lt;\/cluster&gt;<\/pre>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>This concludes the configuration file closing XML tags still opened.<\/p>\n<p><a name=\"Useful_cluster_commands\"><\/a><\/p>\n<h3>Useful cluster commands<\/h3>\n<ul>\n<li>ccs_tool update \/etc\/cluster\/cluster.conf (update cluster.conf among all nodes)<\/li>\n<li>clustat (see cluster status)<\/li>\n<li>clusvcadm -e &lt;service&gt; (enable\/start a service)<\/li>\n<li>clusvcadm -d &lt;service&gt; (disable\/stop service)<\/li>\n<li>vgs -o vg_name,vg_size,vg_tags (show all volume groups names, size and tags)<\/li>\n<\/ul>\n<p><a name=\"Resources\"><\/a><\/p>\n<h1>Resources<\/h1>\n<ul>\n<li>RedHat Cluster Suite developer wiki:\u00a0<a title=\"http:\/\/sources.redhat.com\/cluster\/wiki\" href=\"http:\/\/sources.redhat.com\/cluster\/wiki\" rel=\"nofollow\">http:\/\/sources.redhat.com\/cluster\/wiki<\/a><\/li>\n<li>RHCS Configuration guide:\u00a0<a title=\"http:\/\/www.redhat.com\/docs\/manuals\/csgfs\" href=\"http:\/\/www.redhat.com\/docs\/manuals\/csgfs\" rel=\"nofollow\">http:\/\/www.redhat.com\/docs\/manuals\/csgfs<\/a><\/li>\n<li>Migrating HP Serviceguard to RedHat Cluster Suite:\u00a0<a title=\"http:\/\/docs.hp.com\" href=\"http:\/\/docs.hp.com\/\" rel=\"nofollow\">4AA1-xxxxENN (May 2009)<\/a><\/li>\n<\/ul>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p> Introduction <\/p>\n<p>Here I wrote up a little tutorial how to configure a standard RHEL cluster. Configuring a RHEL cluster is quite easy but documentation is sparse and not well organized. We will configure a 4 nodes cluster with shared storage and Heatbeat over a different NIC (not the main data link).<\/p>\n<\/p>\n<p> Cluster configuration [&#8230;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[36],"tags":[],"_links":{"self":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/1300"}],"collection":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1300"}],"version-history":[{"count":2,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/1300\/revisions"}],"predecessor-version":[{"id":1302,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/1300\/revisions\/1302"}],"wp:attachment":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1300"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1300"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1300"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}