零. 准备工作
最近公司zabbix服务器升级,鉴于以前单机负载压力较大,宕机之后系统瘫痪,趁机改成双机高可用架构。
以下是用keepalived做HA的搭建过程,以备参考!两台主机,三个ip,两台主机配置zabbix和keepalived用于高可用架设。
IP地址 | 主机名 | 备注 |
---|---|---|
192.168.3.141 | zabbix_master | zabbix/keepalived |
192.168.3.141 | zabbix_slaver | zabbix/keepalived |
192.168.3.144 | VIP | --- |
一. keepalived安装
1. Installation
centOS 7下安装很简单,直接yum
即可。
$ su root $ yum install keepalived$ /usr/sbin/keepalived -D$ pstree |grep keepalived |-keepalived---2*[keepalived]
可以看到实际启动了三个进程。
2. Base Configuration
制作开机启动
pgrep keepalived |xargs kill -15 #重新加载systemctl daemon-reload #设置开机自动启动systemctl enable keepalived.service #取消开机自动启动 systemctl disable keepalived.service #启动systemctl start keepalived.service #停止systemctl stop keepalived.service #状态检查systemctl status keepalived.service
3. Advance Configuration
su rootchown zabbix:zabbix /usr/sbin/keepalivedchown zabbix:zabbix /lib/systemd/system/keepalived.servicechown zabbix:zabbix /etc/keepalived/keepalived.confsu zabbix vim /lib/systemd/system/keepalived.servicevim /etc/keepalived/keepalived.confchmod +x /home/zabbix/keepalived/src/zabbix_status_check2.sh
主要是keepalived.conf和zabbix_status_check2.sh两个脚本,网上资料很多,这里不细讲。
二. 双机启动
1. 主机启动keepalived
Mar 13 17:27:57 vm1184 Keepalived_vrrp[5026]: VRRP_Script(zabbix_status_check) succeededMar 13 17:28:00 vm1184 Keepalived_vrrp[5026]: VRRP_Instance(VI_1) Transition to MASTER STATEMar 13 17:28:01 vm1184 Keepalived_vrrp[5026]: VRRP_Instance(VI_1) Entering MASTER STATEMar 13 17:28:01 vm1184 Keepalived_vrrp[5026]: VRRP_Instance(VI_1) setting protocol VIPs.Mar 13 17:28:01 vm1184 Keepalived_vrrp[5026]: VRRP_Instance(VI_1) Sending gratuitous ARPs on ens160 for 192.168.3.144Mar 13 17:28:01 vm1184 Keepalived_healthcheckers[5025]: Netlink reflector reports IP 192.168.3.144 addedMar 13 17:28:01 vm1184 avahi-daemon[637]: Registering new address record for 192.168.3.144 on ens160.IPv4.Mar 13 17:28:06 vm1184 Keepalived_vrrp[5026]: VRRP_Instance(VI_1) Sending gratuitous ARPs on ens160 for 192.168.3.144
2. 主机地址,VIP已经绑定到网卡
$ ip a1: lo:mtu 65536 qdisc noqueue state UNKNOWN qlen 1 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever2: ens160: mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 00:50:56:94:eb:ad brd ff:ff:ff:ff:ff:ff inet 192.168.3.141/32 brd 192.168.3.141 scope global ens160 valid_lft forever preferred_lft forever inet 192.168.3.144/24 scope global ens160 valid_lft forever preferred_lft forever inet6 fe80::ff15:1cc9:5bd0:b06e/64 scope link valid_lft forever preferred_lft forever
3. 备机日志,keepalived已启动。
$ tail -f /var/log/messagesMar 14 09:24:02 vm1185 Keepalived_healthcheckers[7588]: Registering Kernel netlink command channelMar 14 09:24:02 vm1185 Keepalived_healthcheckers[7588]: Opening file '/etc/keepalived/keepalived.conf'.Mar 14 09:24:02 vm1185 Keepalived_healthcheckers[7588]: Configuration is using : 8087 BytesMar 14 09:24:02 vm1185 Keepalived_healthcheckers[7588]: Using LinkWatch kernel netlink reflector...
4. 备机ip地址,VIP实际并没有绑到网卡上。
$ ip a1: lo:mtu 65536 qdisc noqueue state UNKNOWN qlen 1 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever2: ens160: mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 00:50:56:94:88:c9 brd ff:ff:ff:ff:ff:ff inet 192.168.3.142/32 brd 192.168.3.142 scope global ens160 valid_lft forever preferred_lft forever inet6 fe80::e00c:f722:cd30:ad8b/64 scope link valid_lft forever preferred_lft forever
三. 主备切换
1. 杀掉主机上的keepalived之后,备机现在自动切换成主机了。
Mar 14 09:29:04 vm1185 Keepalived_vrrp[7589]: VRRP_Instance(VI_1) Transition to MASTER STATEMar 14 09:29:05 vm1185 Keepalived_vrrp[7589]: VRRP_Instance(VI_1) Entering MASTER STATEMar 14 09:29:05 vm1185 Keepalived_vrrp[7589]: VRRP_Instance(VI_1) setting protocol VIPs.Mar 14 09:29:05 vm1185 Keepalived_vrrp[7589]: VRRP_Instance(VI_1) Sending gratuitous ARPs on ens160 for 192.168.3.144Mar 14 09:29:05 vm1185 Keepalived_healthcheckers[7588]: Netlink reflector reports IP 192.168.3.144 addedMar 14 09:29:05 vm1185 avahi-daemon[674]: Registering new address record for 192.168.3.144 on ens160.IPv4.Mar 14 09:29:10 vm1185 Keepalived_vrrp[7589]: VRRP_Instance(VI_1) Sending gratuitous ARPs on ens160 for 192.168.3.144
2. 现主机(原备机)现在的网卡情况,VIP已经绑定上去了。
$ ip a1: lo:mtu 65536 qdisc noqueue state UNKNOWN qlen 1 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever2: ens160: mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 00:50:56:94:88:c9 brd ff:ff:ff:ff:ff:ff inet 192.168.3.142/32 brd 192.168.3.142 scope global ens160 valid_lft forever preferred_lft forever inet 192.168.3.144/24 scope global ens160 valid_lft forever preferred_lft forever inet6 fe80::e00c:f722:cd30:ad8b/64 scope link valid_lft forever preferred_lft forever
3. 再启动原主机上的keepalived,原主机变备机,VIP绑定被清除了。
$ tail -f /var/log/messagesMar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) Entering MASTER STATEMar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) setting protocol VIPs.Mar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) Sending gratuitous ARPs on ens160 for 10.18 .8.144Mar 14 09:30:12 vm1184 avahi-daemon[637]: Registering new address record for 192.168.3.144 on ens160.IPv4.Mar 14 09:30:12 vm1184 Keepalived_healthcheckers[11856]: Netlink reflector reports IP 192.168.3.144 addedMar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) Received higher prio advertMar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) Entering BACKUP STATEMar 14 09:30:12 vm1184 Keepalived_vrrp[11857]: VRRP_Instance(VI_1) removing protocol VIPs.Mar 14 09:30:12 vm1184 Keepalived_healthcheckers[11856]: Netlink reflector reports IP 192.168.3.144 removedMar 14 09:30:12 vm1184 avahi-daemon[637]: Withdrawing address record for 192.168.3.144 on ens160.
4. 原主机变备机,VIP绑定确实被清除了。
$ ip a1: lo:mtu 65536 qdisc noqueue state UNKNOWN qlen 1 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever2: ens160: mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 00:50:56:94:eb:ad brd ff:ff:ff:ff:ff:ff inet 192.168.3.141/32 brd 192.168.3.141 scope global ens160 valid_lft forever preferred_lft forever inet6 fe80::ff15:1cc9:5bd0:b06e/64 scope link valid_lft forever preferred_lft forever
四. zabbix_server 宕机实验
# 原server启动方式$ ps -ef |grep zabbix_serverzabbix 31668 31643 0 Mar13 ? 00:00:11 ./sbin/zabbix_server: escalator #1 [processed 0 escalations in 0.001148 sec, idle 3 sec]zabbix 31669 31643 0 Mar13 ? 00:00:03 ./sbin/zabbix_server: proxy poller #1 [exchanged data with 0 proxies in 0.000066 sec, idle 5 sec]zabbix 31670 31643 0 Mar13 ? 00:00:09 ./sbin/zabbix_server: self-monitoring [processed data in 0.000046 sec, idle 1 sec]zabbix 31671 31643 0 Mar13 ? 00:00:03 ./sbin/zabbix_server: task manager [processed 0 task(s) in 0.000289 sec, idle 5 sec] # 杀掉进程,操作宕机$ pgrep zabbix_server |xargs kill -15 # 检查$ ps -ef |grep zabbix_serverzabbix 14053 14028 0 09:59 ? 00:00:00 /home/zabbix/zabbix/sbin/zabbix_server: history syncer #4 [synced 0 items in 0.000043 sec, idle 1 sec]zabbix 14054 14028 0 09:59 ? 00:00:00 /home/zabbix/zabbix/sbin/zabbix_server: escalator #1 [processed 0 escalations in 0.001202 sec, idle 3 sec]zabbix 14055 14028 0 09:59 ? 00:00:00 /home/zabbix/zabbix/sbin/zabbix_server: proxy poller #1 [exchanged data with 0 proxies in 0.000057 sec, idle 5 sec]zabbix 14056 14028 0 09:59 ? 00:00:00 /home/zabbix/zabbix/sbin/zabbix_server: self-monitoring [processed data in 0.000040 sec, idle 1 sec]zabbix 14057 14028 0 09:59 ? 00:00:00 /home/zabbix/zabbix/sbin/zabbix_server: task manager [processed 0 task(s) in 0.000433 sec, idle 5 sec]
说明keepalived的健康检查脚本确实执行了,检查不到zabbix_server进程的情况下会自动重启。