当前位置：首页 > news >正文

别再只懂Deployment了！用K8S探针（Liveness/Readiness/Startup）和优雅停机，给你的Spring Boot应用上双保险

news 2026/6/10 22:12:18

Kubernetes探针与Spring Boot优雅停机的深度协同实践

在云原生时代，如何确保应用在Kubernetes环境中的高可用性成为开发者必须掌握的技能。本文将深入探讨Kubernetes探针机制与Spring Boot优雅停机的协同工作原理，帮助开发者构建真正"无感知"的云原生应用。

1. Kubernetes探针机制解析

Kubernetes提供了三种探针来监控和管理容器生命周期，每种探针都有其特定的应用场景和工作原理。

1.1 Liveness探针：应用健康守护者

LivenessProbe用于检测容器是否处于运行状态。当探针失败时，kubelet会终止容器并根据重启策略决定后续操作。对于Spring Boot应用，典型的Liveness检查配置如下：

livenessProbe: httpGet: path: /actuator/health/liveness port: 8080 initialDelaySeconds: 30 periodSeconds: 10

关键参数说明：

initialDelaySeconds：容器启动后等待多少秒开始第一次探测
periodSeconds：探测间隔时间
timeoutSeconds：探测超时时间
failureThreshold：连续失败多少次才判定为不健康

1.2 Readiness探针：流量控制阀门

ReadinessProbe决定容器是否准备好接收流量。与Liveness不同，Readiness失败不会导致容器重启，而是从Service的Endpoint中移除该Pod。Spring Boot的典型配置：

readinessProbe: httpGet: path: /actuator/health/readiness port: 8080 initialDelaySeconds: 30 periodSeconds: 5

实际案例：某电商平台在促销活动期间，由于未配置Readiness探针，新部署的Pod在数据库连接未完全建立时就接收流量，导致大量504错误。配置Readiness探针后，只有当所有依赖服务连接就绪后Pod才会接收流量。

1.3 Startup探针：慢启动应用救星

StartupProbe专为启动缓慢的应用设计，它会暂时禁用其他探针直到应用完成启动。这对于需要长时间初始化的Spring Boot应用特别有用：

startupProbe: httpGet: path: /actuator/health/startup port: 8080 failureThreshold: 30 periodSeconds: 10

三种探针对比表：

探针类型	检测失败后果	适用场景	默认状态
Liveness	重启容器	检测死锁、无限循环	Success
Readiness	移除Endpoint	检测依赖服务可用性	Success
Startup	重启容器	长时间启动的应用	Success

2. Spring Boot与Kubernetes探针的集成

Spring Boot从2.3版本开始提供了对Kubernetes探针的原生支持，通过Actuator端点实现无缝集成。

2.1 Actuator健康端点配置

首先需要在pom.xml中添加依赖：

<dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency>

然后在application.properties中启用相关端点：

management.endpoint.health.probes.enabled=true management.endpoints.web.exposure.include=health management.endpoint.health.show-details=always

2.2 自定义健康指标

Spring Boot允许开发者自定义健康指标来更精确地反映应用状态：

@Component public class CustomHealthIndicator implements HealthIndicator { @Override public Health health() { // 检查外部系统连接状态 boolean externalSystemOk = checkExternalSystem(); return externalSystemOk ? Health.up().build() : Health.down().withDetail("reason", "External system unavailable").build(); } }

2.3 探针端点的高级配置

对于复杂场景，可以单独配置各个探针端点：

management.endpoint.health.group.liveness.include=livenessState,customCheck management.endpoint.health.group.readiness.include=readinessState,db,redis

3. 优雅停机与Kubernetes的协同

优雅停机(Graceful Shutdown)是确保应用平滑下线的关键机制，需要与Kubernetes的生命周期管理协同工作。

3.1 Spring Boot优雅停机配置

Spring Boot 2.3+内置了优雅停机支持，只需简单配置：

server.shutdown=graceful spring.lifecycle.timeout-per-shutdown-phase=30s

工作原理：

收到SIGTERM信号后停止接收新请求
等待正在处理的请求完成
关闭应用上下文

3.2 Kubernetes生命周期钩子

Kubernetes提供了preStop钩子，可以与Spring Boot的优雅停机机制配合使用：

lifecycle: preStop: exec: command: ["sh", "-c", "sleep 10"]

经验分享：在实际项目中，我们发现仅依赖Spring Boot的优雅停机可能不够，因为Kubernetes组件更新Endpoint需要时间。最佳实践是结合preStop钩子，给予足够的缓冲时间。

3.3 terminationGracePeriodSeconds优化

这个参数决定了Kubernetes等待应用优雅退出的最长时间：

spec: terminationGracePeriodSeconds: 60

配置建议：

通常设置为略大于应用最长请求处理时间
考虑集群规模和Endpoint传播延迟
监控实际停机时间进行调整

4. 滚动更新策略与探针的协同

合理的滚动更新策略配合探针配置可以实现真正的零停机部署。

4.1 Deployment滚动更新配置

strategy: type: RollingUpdate rollingUpdate: maxSurge: 25% maxUnavailable: 25%

参数说明：

maxSurge：更新过程中可以超过期望Pod数量的最大值
maxUnavailable：更新过程中不可用Pod的最大数量

4.2 就绪探针与滚动更新

就绪探针直接影响滚动更新的效果。不合理的配置可能导致：

新Pod未就绪就接收流量
旧Pod过早终止导致请求中断
更新过程卡住

故障排查案例：某次部署后发现滚动更新卡在50%，经查是Readiness探针的successThreshold设置过大，导致Kubernetes认为新Pod一直未就绪。

4.3 最佳实践配置示例

apiVersion: apps/v1 kind: Deployment metadata: name: spring-boot-app spec: replicas: 3 strategy: type: RollingUpdate rollingUpdate: maxSurge: 1 maxUnavailable: 0 template: spec: containers: - name: app image: my-spring-boot-app:latest ports: - containerPort: 8080 livenessProbe: httpGet: path: /actuator/health/liveness port: 8080 initialDelaySeconds: 30 periodSeconds: 10 readinessProbe: httpGet: path: /actuator/health/readiness port: 8080 initialDelaySeconds: 30 periodSeconds: 5 failureThreshold: 3 startupProbe: httpGet: path: /actuator/health/startup port: 8080 failureThreshold: 30 periodSeconds: 10 lifecycle: preStop: exec: command: ["sh", "-c", "sleep 15"] terminationGracePeriodSeconds: 45