ZooKeeper Cluster 架构文档
集群标识
| 属性 |
值 |
| 集群名称 |
zookeeper-e93aadcf |
| 命名空间 |
qfusion-admin |
1. 环境信息
1.1 Kubernetes 集群
| 属性 |
值 |
| KUBECONFIG |
/bpx/.145-admin.conf |
| Kubernetes 版本 |
v1.24.10 |
| 网络插件 |
Cilium CNI |
| 容器运行时 |
containerd://1.7.3 |
1.2 工作节点分布
| 节点名称 |
IP 地址 |
操作系统 |
角色 |
ZooKeeper Pod |
| qfusion1 |
x.x.x.145 |
openEuler 22.03 (LTS-SP1) |
control-plane,master |
zookeeper-e93aadcf-0-0 |
| qfusion2 |
x.x.x.148 |
openEuler 22.03 (LTS-SP1) |
control-plane,master |
zookeeper-e93aadcf-observer-0-0 |
| qfusion3 |
x.x.x.150 |
openEuler 22.03 LTS |
control-plane,master |
zookeeper-e93aadcf-2-0 |
| qfusion4 |
x.x.x.147 |
Kylin Linux V10 (Halberd) |
worker |
zookeeper-e93aadcf-1-0 |
2. 集群架构概览
2.1 集群拓扑图
┌─────────────────────────────────────────────────────────────────────────────────┐
│ ZooKeeper Cluster: zookeeper-e93aadcf │
│ 版本: 3.7.1 │
├─────────────────────────────────────────────────────────────────────────────────┤
│ │
│ ┌─────────────────────────────────────────────────────────────────────────┐ │
│ │ Quorum 集群 (3节点) │ │
│ ├─────────────────────────────────────────────────────────────────────────┤ │
│ │ │ │
│ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │
│ │ │ Replica 0 │ │ Replica 1 │ │ Replica 2 │ │ │
│ │ │ Participant │ │ Participant │ │ Participant │ │ │
│ │ │ │ │ │ │ │ │ │
│ │ │ Node: qfusion1 │ │ Node: qfusion4 │ │ Node: qfusion3 │ │ │
│ │ │ IP: 245.0.0.94 │ │ IP: 245.0.3.243 │ │ IP: 245.0.2.35 │ │ │
│ │ │ myid: 0 │ │ myid: 1 │ │ myid: 2 │ │ │
│ │ └────────┬─────────┘ └────────┬─────────┘ └────────┬─────────┘ │ │
│ │ │ │ │ │ │
│ │ └───────────────────────┼───────────────────────┘ │ │
│ │ │ │ │
│ │ ┌───────────┴───────────┐ │ │
│ │ │ Leader Election │ │ │
│ │ │ (Port 3888) │ │ │
│ │ └───────────────────────┘ │ │
│ └─────────────────────────────────────────────────────────────────────────┘ │
│ │ │
│ ▼ │
│ ┌─────────────────────────────────────────────────────────────────────────┐ │
│ │ Observer 节点 (只读副本) │ │
│ ├─────────────────────────────────────────────────────────────────────────┤ │
│ │ ┌──────────────────┐ │ │
│ │ │ Observer-0 │ │ │
│ │ │ │ │ │
│ │ │ Node: qfusion2 │ │ │
│ │ │ IP: 245.0.1.95 │ │ │
│ │ │ myid: 3 │ │ │
│ │ └──────────────────┘ │ │
│ └─────────────────────────────────────────────────────────────────────────┘ │
│ │
└─────────────────────────────────────────────────────────────────────────────────┘
2.2 集群规模统计
| 指标 |
值 |
| Participant 副本数 |
3 |
| Observer 副本数 |
1 |
| 总 Pod 数 |
4 |
| 总容器数 |
8 (每Pod 2个容器) |
| 存储总容量 |
80Gi (4 × 20Gi) |


Sources
3. Pod 详细信息
3.1 Participant 节点
Replica 0
| 属性 |
值 |
| Pod 名称 |
zookeeper-e93aadcf-0-0 |
| 命名空间 |
qfusion-admin |
| 角色 |
Participant (投票节点) |
| StatefulSet |
zookeeper-e93aadcf-0 |
| 节点 |
qfusion1 (x.x.x.145) |
| Pod IP |
245.0.0.94 |
| FQDN |
zookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local |
| 状态 |
Running |
| Ready |
2/2 |
| 重启次数 |
0 |
| 创建时间 |
2025-12-24 22:07:29 |
| myid |
0 |
| Zone |
zone1 |
Replica 1
| 属性 |
值 |
| Pod 名称 |
zookeeper-e93aadcf-1-0 |
| 命名空间 |
qfusion-admin |
| 角色 |
Participant (投票节点) |
| StatefulSet |
zookeeper-e93aadcf-1 |
| 节点 |
qfusion4 (x.x.x.147) |
| Pod IP |
245.0.3.243 |
| FQDN |
zookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local |
| 状态 |
Running |
| Ready |
2/2 |
| 重启次数 |
2 |
| 创建时间 |
2025-12-24 22:07:49 |
| myid |
1 |
Replica 2
| 属性 |
值 |
| Pod 名称 |
zookeeper-e93aadcf-2-0 |
| 命名空间 |
qfusion-admin |
| 角色 |
Participant (投票节点) |
| StatefulSet |
zookeeper-e93aadcf-2 |
| 节点 |
qfusion3 (x.x.x.150) |
| Pod IP |
245.0.2.35 |
| FQDN |
zookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local |
| 状态 |
Running |
| Ready |
2/2 |
| 重启次数 |
0 |
| 创建时间 |
2025-12-24 22:08:41 |
| myid |
2 |
3.2 Observer 节点
| 属性 |
值 |
| Pod 名称 |
zookeeper-e93aadcf-observer-0-0 |
| 命名空间 |
qfusion-admin |
| 角色 |
Observer (只读节点,不参与投票) |
| StatefulSet |
zookeeper-e93aadcf-observer-0 |
| 节点 |
qfusion2 (x.x.x.148) |
| Pod IP |
245.0.1.95 |
| FQDN |
zookeeper-e93aadcf-observer-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local |
| 状态 |
Running |
| Ready |
2/2 |
| 重启次数 |
0 |
| 创建时间 |
2025-12-24 22:09:15 |
| myid |
3 |
4. 容器架构
4.1 Pod 容器组成
每个 ZooKeeper Pod 包含 2 个容器:
┌─────────────────────────────────────────────────────────────────────────┐
│ Pod │
├─────────────────────────────────────────────────────────────────────────┤
│ │
│ ┌───────────────────────────────────────────────────────────────────┐ │
│ │ Container 1: zookeeper (主容器) │ │
│ ├───────────────────────────────────────────────────────────────────┤ │
│ │ Image: k8smaster.qfusion.irds/irds/zookeeper-3.7:v1.1.0 │ │
│ │ Image ID: sha256:15681693... │ │
│ │ Command: /usr/local/bin/zookeeperStart.sh │ │
│ │ Pull Policy: Always │ │
│ │ │ │
│ │ Resources: │ │
│ │ CPU Request: 500m │ │
│ │ CPU Limit: 1000m │ │
│ │ Memory Request: 2Gi │ │
│ │ Memory Limit: 2Gi │ │
│ │ │ │
│ │ Ports: │ │
│ │ 2181/tcp - client (客户端连接) │ │
│ │ 2888/tcp - quorum (集群通信) │ │
│ │ 3888/tcp - leader-election (领导者选举) │ │
│ │ 7000/tcp - metrics (Prometheus监控指标) │ │
│ │ 8080/tcp - admin-server (Jetty管理服务器) │ │
│ │ │ │
│ │ Health Checks: │ │
│ │ Liveness: exec [zookeeperLive.sh] │ │
│ │ delay=10s, timeout=10s, period=10s │ │
│ │ Readiness: exec [zookeeperReady.sh] │ │
│ │ delay=10s, timeout=10s, period=10s │ │
│ │ │ │
│ │ Lifecycle: │ │
│ │ PreStop: exec [zookeeperTeardown.sh] │ │
│ │ │ │
│ │ Volume Mounts: │ │
│ │ /conf - ConfigMap (zookeeper-e93aadcf-configmap) │ │
│ │ /data - PVC (持久化数据) │ │
│ │ /mnt/log - EmptyDir (日志) │ │
│ │ /etc/localtime - HostPath (时区) │ │
│ └───────────────────────────────────────────────────────────────────┘ │
│ │
│ ┌───────────────────────────────────────────────────────────────────┐ │
│ │ Container 2: ssh (备份/管理容器) │ │
│ ├───────────────────────────────────────────────────────────────────┤ │
│ │ Image: k8smaster.qfusion.irds/irds/zookeeper-backup:v3.14.4 │ │
│ │ Image ID: sha256:47b43198... │ │
│ │ Pull Policy: IfNotPresent │ │
│ │ │ │
│ │ Resources: │ │
│ │ CPU Request: 250m │ │
│ │ CPU Limit: 500m │ │
│ │ Memory Request: 1Gi │ │
│ │ Memory Limit: 1Gi │ │
│ │ │ │
│ │ Security Context: │ │
│ │ Privileged: true │ │
│ │ Capabilities: │ │
│ │ - SYS_ADMIN │ │
│ │ - DAC_READ_SEARCH │ │
│ │ │ │
│ │ Volume Mounts: │ │
│ │ /conf - ConfigMap │ │
│ │ /data - PVC (共享主容器数据,用于备份) │ │
│ │ /mnt/log - EmptyDir │ │
│ │ /etc/localtime - HostPath │ │
│ └───────────────────────────────────────────────────────────────────┘ │
│ │
└─────────────────────────────────────────────────────────────────────────┘
4.2 容器环境变量
zookeeper 容器环境变量
| 变量名 |
值 |
说明 |
| ZOO_ROLE |
participant |
节点角色 |
| ZOO_LOG_DIR |
/mnt/log |
日志目录 |
| ZOO_LOG4J_PROP |
LOG,CONSOLE,ROLLINGFILE |
日志输出格式 |
| ZK_SERVER_HEAP |
1536 |
服务端堆大小(MB) |
| SERVER_JVMFLAGS |
-Xms512m -XX:+AlwaysPreTouch -Djute.maxbuffer=8388608 -XX:MaxGCPauseMillis=50 |
JVM参数 |
| ENABLE_ACL |
false |
是否启用ACL |
| ZOO_SUPER_ID |
* (from secret) |
超级用户ID |
| ZOO_SUPER_DIGEST |
* (from secret) |
超级用户摘要 |
| CLIENT_JVMFLAGS |
* (empty) |
客户端JVM参数 |
| ZU_JVMFLAGS |
* (empty) |
额外JVM参数 |
5. 存储架构
5.1 持久化卷声明 (PVC)
| PVC 名称 |
容量 |
访问模式 |
存储类 |
绑定 PV |
绑定节点 |
| data-zookeeper-e93aadcf-0-0 |
20Gi |
RWO |
csi-localpv |
data-zookeeper-e93aadcf-0-0-qfusion1-f7aa7c0f |
qfusion1 |
| data-zookeeper-e93aadcf-1-0 |
20Gi |
RWO |
csi-localpv |
data-zookeeper-e93aadcf-1-0-qfusion4-fc233897 |
qfusion4 |
| data-zookeeper-e93aadcf-2-0 |
20Gi |
RWO |
csi-localpv |
data-zookeeper-e93aadcf-2-0-qfusion3-5ea93ba0 |
qfusion3 |
| data-zookeeper-e93aadcf-observer-0-0 |
20Gi |
RWO |
csi-localpv |
data-zookeeper-e93aadcf-observer-0-0-qfusion2-3b05d9e5 |
qfusion2 |
5.2 存储目录结构
/mnt/log/ # EmptyDir - 日志目录
├── server.log # ZooKeeper 服务日志
/data/ # PVC - 数据目录
├── version-2/ # 快照和事务日志目录
│ ├── snapshot.0 # 快照文件
│ └── log.0 # 事务日志
├── zoo.cfg.dynamic # 动态配置文件
└── myid # 节点ID文件
/conf/ # ConfigMap - 配置目录
├── zoo.cfg # 主配置文件
├── log4j.properties # 日志配置
├── jaas.config # SASL认证配置
└── env.sh # 环境变量脚本
5.3 存储策略
| 属性 |
值 |
| 存储类型 |
Persistence (持久化) |
| Reclaim Policy |
Delete (删除PVC时自动删除PV) |
| Volume Mode |
Filesystem |
| IOPS 配额 |
2000 |
6. 网络架构
6.1 服务列表
| 服务名称 |
类型 |
Cluster IP |
端口 |
选择器 |
| zookeeper-e93aadcf-client |
ClusterIP |
246.102.223.122 |
2181/TCP |
app=zookeeper-e93aadcf,kind=ZookeeperMember |
| zookeeper-e93aadcf-admin-server |
ClusterIP |
246.99.195.124 |
8080/TCP |
app=zookeeper-e93aadcf,kind=ZookeeperMember |
| zookeeper-e93aadcf-headless |
ClusterIP (None) |
None |
2181,2888,3888,7000,8080 |
app=zookeeper-e93aadcf,kind=ZookeeperMember |
| zookeeper-e93aadcf-client-lb |
ClusterIP |
246.104.64.160 |
31144/TCP |
app=zookeeper-e93aadcf,kind=ZookeeperMember |
6.2 端口映射
| 内部端口 |
名称 |
协议 |
用途 |
| 2181 |
client |
TCP |
客户端连接端口,处理客户端请求 |
| 2888 |
quorum |
TCP |
集群成员间通信端口,用于数据同步 |
| 3888 |
leader-election |
TCP |
领导者选举端口 |
| 7000 |
metrics |
TCP |
Prometheus 指标导出端口 |
| 8080 |
admin-server |
TCP |
Jetty 管理服务器端口,暴露命令和监控信息 |
6.3 网络端点
内部客户端端点
zookeeper-e93aadcf-client.qfusion-admin.svc.cluster.local:2181
246.102.223.122:2181
Headless 服务端点 (直接访问各节点)
zookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181
zookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181
zookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181
zookeeper-e93aadcf-observer-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181
6.4 Endpoints 详情
Client Service Endpoints
245.0.0.94:2181 (Replica-0)
245.0.3.243:2181 (Replica-1)
245.0.2.35:2181 (Replica-2)
245.0.1.95:2181 (Observer-0)
Admin Server Endpoints
245.0.0.94:8080 (Replica-0)
245.0.3.243:8080 (Replica-1)
245.0.2.35:8080 (Replica-2)
245.0.1.95:8080 (Observer-0)
Headless Service Endpoints
245.0.0.94:2181,2888,3888,7000,8080 (Replica-0)
245.0.3.243:2181,2888,3888,7000,8080 (Replica-1)
245.0.2.35:2181,2888,3888,7000,8080 (Replica-2)
245.0.1.95:2181,2888,3888,7000,8080 (Observer-0)
7. ZooKeeper 配置详解
7.1 主配置文件 (zoo.cfg)
# 基础配置
skipACL=yes # 跳过ACL检查
standaloneEnabled=false # 禁用单机模式
dataDir=/data # 数据目录
admin.serverPort=8080 # 管理服务器端口
# 时间配置 (单位: 毫秒)
tickTime=2000 # 心跳时间间隔 (2秒)
initLimit=10 # 初始连接超时 (10*tickTime=20秒)
syncLimit=2 # 同步超时 (2*tickTime=4秒)
# 性能配置
globalOutstandingLimit=1000 # 全局未完成请求限制
preAllocSize=65536 # 预分配磁盘空间大小 (64KB)
snapCount=10000 # 每N个事务写一次快照
commitLogCount=500 # 提交日志数量
snapSizeLimitInKb=4194304 # 快照大小限制 (4GB)
# 连接配置
maxCnxns=0 # 最大连接数 (0=无限制)
maxClientCnxns=60 # 每个客户端最大连接数
# 会话超时配置
minSessionTimeout=4000 # 最小会话超时 (4秒)
maxSessionTimeout=40000 # 最大会话超时 (40秒)
# 自动清理配置
autopurge.purgeInterval=1 # 自动清理间隔 (小时)
autopurge.snapRetainCount=3 # 保留的快照数量
# 网络配置
quorumListenOnAllIPs=false # 不监听所有IP
# 四字命令白名单
4lw.commands.whitelist=cons, envi, conf, crst, srvr, stat, mntr, ruok
# Prometheus 监控配置
metricsProvider.exportJvmInfo=true # 导出JVM信息
metricsProvider.httpPort=7000 # 指标端口
metricsProvider.className=org.apache.zookeeper.metrics.prometheus.PrometheusMetricsProvider
# 动态配置
reconfigEnabled=true # 启用动态重配置
dynamicConfigFile=/data/zoo.cfg.dynamic # 动态配置文件路径
7.2 环境配置 (env.sh)
#!/usr/bin/env bash
DOMAIN=zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local
QUORUM_PORT=2888
LEADER_PORT=3888
CLIENT_HOST=zookeeper-e93aadcf-client
CLIENT_PORT=2181
ADMIN_SERVER_HOST=zookeeper-e93aadcf-admin-server
ADMIN_SERVER_PORT=8080
CLUSTER_NAME=zookeeper-e93aadcf
CLUSTER_SIZE=3
7.3 日志配置 (log4j.properties)
# 日志根配置
zookeeper.root.logger=CONSOLE,ROLLINGFILE
zookeeper.console.threshold=INFO
zookeeper.log.dir=/mnt/log
zookeeper.log.file=server.log
zookeeper.log.threshold=INFO
# RollingFile 配置
log4j.appender.ROLLINGFILE.MaxFileSize=10MB # 单个日志文件最大10MB
log4j.appender.ROLLINGFILE.MaxBackupIndex=10 # 保留10个历史文件
7.4 SASL 认证配置 (jaas.config)
Server {
org.apache.zookeeper.server.auth.DigestLoginModule required
user_qfusion="mcd@loHmiDdoI0E9";
};
Client {
org.apache.zookeeper.server.auth.DigestLoginModule required
username="qfusion"
password="mcd@loHmiDdoI0E9";
};
8. 调度与亲和性
8.1 Pod 反亲和性规则
podAntiAffinity:
preferredDuringSchedulingIgnoredDuringExecution:
- weight: 20
podAffinityTerm:
labelSelector:
matchExpressions:
- key: app
operator: In
values:
- zookeeper-e93aadcf
topologyKey: kubernetes.io/hostname
说明: 尽量将 ZooKeeper Pod 调度到不同的节点上,权重为20(软约束)
8.2 节点分布
| Pod |
节点 |
Zone |
| zookeeper-e93aadcf-0-0 |
qfusion1 |
zone1 |
| zookeeper-e93aadcf-1-0 |
qfusion4 |
- |
| zookeeper-e93aadcf-2-0 |
qfusion3 |
- |
| zookeeper-e93aadcf-observer-0-0 |
qfusion2 |
- |
9. 安全与认证
9.1 ServiceAccount
| 属性 |
值 |
| 名称 |
dboperator |
| 命名空间 |
qfusion-admin |
| UID |
aa4358dc-1705-4f1a-aec2-935076cb84a2 |
9.2 Secret
| Secret 名称 |
类型 |
数据项 |
| zookeeper-e93aadcf-acl |
Opaque |
id, digest (超级用户认证信息) |
9.3 安全上下文
ssh 容器 (备份容器)
privileged: true
capabilities:
add:
- SYS_ADMIN
- DAC_READ_SEARCH
10. 监控与健康检查
10.1 健康检查配置
Liveness Probe (存活探针)
exec:
command: [zookeeperLive.sh]
initialDelaySeconds: 10
timeoutSeconds: 10
periodSeconds: 10
failureThreshold: 3
Readiness Probe (就绪探针)
exec:
command: [zookeeperReady.sh]
initialDelaySeconds: 10
timeoutSeconds: 10
periodSeconds: 10
failureThreshold: 3
successThreshold: 1
10.2 Prometheus 监控指标
| 指标类型 |
端口 |
路径 |
| JVM Metrics |
7000 |
/metrics |
| Admin Server |
8080 |
/commands |
10.3 四字命令
| 命令 |
描述 |
端口 |
| ruok |
检查服务是否运行 |
2181 |
| srvr |
列出服务器详细信息 |
2181 |
| stat |
输出服务器统计信息 |
2181 |
| mntr |
输出监控指标 |
2181 |
| cons |
列出所有连接的客户端 |
2181 |
| envi |
打印服务环境信息 |
2181 |
| conf |
打印配置信息 |
2181 |
| crst |
重置连接统计信息 |
2181 |
11. Operator 管理
11.1 ZooKeeper Operator
| 属性 |
值 |
| Pod 名称 |
zookeeper-operator-bcc44b994-wmqq6 |
| 命名空间 |
qfusion |
| 镜像 |
k8smaster.qfusion.irds/irds/zookeeper-operator:b4f4c95e… |
| 节点 |
qfusion3 |
| ServiceAccount |
zookeeper-operator |
| CPU Request |
10m |
| CPU Limit |
500m |
| Memory Request |
10Ki |
| Memory Limit |
500Mi |
11.2 Operator 启动参数
zookeeper-operator \
-metrics-bind-address=127.0.0.1:6000
11.3 Web 管理界面
| 属性 |
值 |
| 服务名称 |
zookeeper-webserver |
| 命名空间 |
qfusion |
| 类型 |
Deployment |
| 副本数 |
2 |
| Cluster IP |
246.98.8.26 |
| 端口 |
9090 |
12. 资源汇总
12.1 单 Pod 资源
| 资源类型 |
Request |
Limit |
| CPU |
750m |
1.5 cores |
| Memory |
3Gi |
3Gi |
| Storage |
20Gi |
20Gi |
12.2 集群总资源 (4 Pods)
| 资源类型 |
Request |
Limit |
| CPU |
3 cores |
6 cores |
| Memory |
12Gi |
12Gi |
| Storage |
80Gi |
80Gi |
12.3 Operator 资源
| 资源类型 |
Request |
Limit |
| CPU |
10m |
500m |
| Memory |
10Ki |
500Mi |
13. 自定义资源 (CRD)
13.1 ZookeeperCluster CR
apiVersion: zookeeper.pravega.io/v1beta1
kind: ZookeeperCluster
metadata:
name: zookeeper-e93aadcf
namespace: qfusion-admin
spec:
replicas: 3
observerReplicas: 1
maxUnavailableReplicas: 1
image:
repository: k8smaster.qfusion.irds/irds/zookeeper-3.7
tag: v1.1.0
pullPolicy: Always
storageType: Persistence
iopsQuota: 2000
persistence:
reclaimPolicy: Delete
spec:
storageClassName: csi-localpv
accessModes: [ReadWriteOnce]
resources:
requests:
storage: 20Gi
13.2 集群状态
status:
replicas: 4
readyReplicas: 4
internalClientEndpoint: 246.102.223.122:2181
externalClientEndpoint: N/A
members:
ready:
- zookeeper-e93aadcf-0-0
- zookeeper-e93aadcf-1-0
- zookeeper-e93aadcf-2-0
- zookeeper-e93aadcf-observer-0-0
conditions:
- type: PodsReady
status: "True"
- type: Error
status: "False"
- type: Upgrading
status: "False"
14. 标签 (Labels)
14.1 集群级别标签
| Key |
Value |
| AppName |
zookeeper-e93aadcf |
| CreatedBy |
woqutech.com |
| DBBranch |
zookeeperv3.7.1 |
| DBType |
zookeeper |
| ResourceUseType |
Database |
| TenantId |
847798ee3db44716b6357b04e5a55c16 |
| app |
zookeeper-e93aadcf |
| release |
zookeeper-e93aadcf |
14.2 Pod 额外标签
| Key |
Value |
范围 |
| Type |
Database |
所有 Pod |
| kind |
ZookeeperMember |
所有 Pod |
| replica |
0/1/2 |
Participant Pod |
| zk_role |
Participant/Observer |
所有 Pod |
| qfusion/zone |
zone1 |
Replica-0 |
15. Finalizers
| Finalizer |
说明 |
| cleanUpZookeeperPVC |
删除集群时自动清理PVC资源 |
16. 连接示例
16.1 客户端连接字符串
zookeeper-e93aadcf-client.qfusion-admin.svc.cluster.local:2181
zookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181,\
zookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181,\
zookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181
16.2 获取集群状态
export KUBECONFIG=/bpx/.145-admin.conf
kubectl get ZookeeperCluster zookeeper-e93aadcf -n qfusion-admin
kubectl get pods -n qfusion-admin -l app=zookeeper-e93aadcf
kubectl get svc -n qfusion-admin -l app=zookeeper-e93aadcf
kubectl get pvc -n qfusion-admin -l app=zookeeper-e93aadcf
kubectl exec -it -n qfusion-admin zookeeper-e93aadcf-0-0 -c zookeeper -- /bin/bash
17. 运维参考
17.1 常用命令
echo ruok | nc 245.0.0.94 2181
echo srvr | nc 245.0.0.94 2181
echo mntr | nc 245.0.0.94 2181
echo cons | nc 245.0.0.94 2181
curl http://245.0.0.94:7000/metrics
curl http://245.0.0.94:8080/commands
17.2 日志查看
kubectl logs -n qfusion-admin zookeeper-e93aadcf-0-0 -c zookeeper
kubectl logs -f -n qfusion-admin zookeeper-e93aadcf-0-0 -c zookeeper
kubectl logs -n qfusion-admin zookeeper-e93aadcf-0-0 -c ssh
17.3 故障排查
| 症状 |
可能原因 |
排查方法 |
| Pod 无法启动 |
资源不足、镜像拉取失败 |
kubectl describe pod |
| 连接超时 |
网络问题、防火墙 |
kubectl get svc, endpoints |
| 数据丢失 |
PVC 问题 |
kubectl get pvc, pv |
| 领导者选举失败 |
网络分区、节点故障 |
检查节点状态和日志 |
18. 附录
18.1 文档版本
| 版本 |
日期 |
说明 |
| 1.0 |
2025-12-24 |
初始版本 |
18.2 相关文档
本文档由 AI 自动生成,基于 Kubernetes 集群 145 的实时配置
所有评论(0)