当前位置: 首页 > news >正文

137. 集群或节点配置卡在节点污染“node.cloudprovider.kubernetes.io/uninitialized”

During the provisioning of RKE2 clusters, the machines are stuck with the status 'waiting for cluster agent'. The rke2-server service is running and pods are being created, but a number of them are in a pending state due to scheduling errors.
在配置 RKE2 集群时,机器会处于“等待集群代理”状态。rke2 服务器服务正在运行,pod 正在创建,但由于调度错误,其中不少 Pod 处于待处理状态。

Example: The vSphere CPI (Cloud Provider Interface) is unable to locate the virtual machine in vSphere, which results in the node being uninitialised. In the downstream cluster, the cloud controller manager pod logs indicate this error locating the virtual machine:
示例:vSphere CPI(云服务提供商接口)无法在 vSphere 中定位虚拟机,导致节点未初始化。在下游集群中,云控制器管理器的 Pod 日志显示了该虚拟机定位的错误:

search.go:186] Did not find node node1.example.com in vc=example.com and datacenter=datacentre1 nodemanager.go:160] WhichVCandDCByNodeID failed using VM name. Err: No VM found nodemanager.go:205] shakeOutNodeIDLookup failed. Err=No VM found node_controller.go:233] error syncing 'node1.example.com: failed to get instance metadata for node node1.example.com: failed to get instance ID from cloud provider: No VM found, requeuing node_controller.go:244] "Unhandled Error" err="error syncing 'node1.example.com': failed to get instance metadata for node node1.example.com: failed to get instance ID from cloud provider: No VM found, requeuing" node_controller.go:271] Update 1 nodes status took 57.912µs.
Resolution 结局

In order to resolve this issue, validate and correct the Cloud Provider configuration for the affected cluster, as required.
为解决此问题,请根据需要验证并纠正受影响集群的云提供商配置。

In the example above, with the vSphere Cloud Provider, you would need to check the Add-on: vSphere CPI configuration for the cluster, to ensure the correct vCenter and Data Center was configured, as well as validating that VMware Tools was running successfully in the virtual machine, and its hostname was correctly configured.
在上述示例中,使用 vSphere Cloud Provider 时,你需要检查集群的 Add-on: vSphere CPI 配置,以确保正确的 vCenter 和 Data Center 配置正确,同时验证 VMware Tools 在虚拟机中是否成功运行,主机名配置正确。

Cause 病因

The node.cloudprovider.kubernetes.io/uninitialized taint is added to new nodes in clusters where a Cloud Provider is configured. This taint is removed by the CPI once it successfully queries and sets the spec.providerID on the node. If there is a problem with the CPI configuration and this cannot be successfully queried, then the node will remain in this state and fail to complete provisioning. If this is the first node in the cluster then the cluster itself will be stuck in provisioning.
node.cloudprovider.kubernetes.io/uninitialized 污染会被添加到配置云服务提供商的集群中新节点。一旦 CPI 成功查询并设置了节点上的 spec.providerID,这个污点就会被清除。如果 CPI 配置存在问题且无法成功查询,节点将保持该状态,无法完成配置。如果这是集群中的第一个节点,那么集群本身将被卡在配置中。

Additional Information 附加信息
Environment 环境
  • A Rancher-provisioned RKE2 cluster with a Cloud Provider configured
    一个由 Rancher 配置的 RKE2 集群,配置了云提供商

访问Rancher-K8S解决方案博主,企业合作伙伴 :
https://blog.csdn.net/lidw2009

http://www.jsqmd.com/news/680327/

相关文章:

  • 从‘删库到跑路’梗说起:聊聊rm -rf的设计哲学与Windows命令的替代方案
  • 2026车辆轮船幕墙防火阻燃密封条实力厂家推荐:车辆轮船设备密封、条幕墙密封、防火阻燃密封条 - 栗子测评
  • 云微海外短剧系统多少钱?多语言多支付搭建包上架
  • 139. 由于卸载Rancher主目录,恢复失败
  • 51单片机新手必看:Proteus里让LM016L液晶屏显示字符的保姆级教程(附完整代码)
  • 2026年质量好的阳台门窗/推拉门窗品牌厂家推荐 - 行业平台推荐
  • 保姆级教程:在Deepin/UOS上手动打包最新版QQ的deb安装包(附字体修复方案)
  • 弱口令漏洞挖掘的30个实战技巧!
  • 140. 如何使用 nginx /dbg
  • 2026年热门的推拉门窗/大连阳光房门窗/卧室门窗厂家综合对比分析 - 品牌宣传支持者
  • 华为eNSP模拟企业网:三层交换机DHCP配置保姆级教程(含VLAN规划与排错)
  • 2026影视拍摄模特选购指南:宣传单派发、小丑魔术师、展会充场、展会兼职、展会模特礼仪、展会派发传单、展会礼仪模特选择指南 - 优质品牌商家
  • 告别付费数据源!用Python+Baostock+MySQL搭建你的免费股票数据本地库(保姆级教程)
  • 智能车竞赛节能信标改造:用ITR9909+BC517达林顿管替换霍尔传感器(附完整电路图)
  • 告别手动抄数据:5分钟学会用WebPlotDigitizer智能提取图表数据
  • 360挖出微软8年致命漏洞,10亿用户安危被改写
  • 为什么头部云厂商已强制要求Docker 27低代码标准?揭秘CNCF最新容器化准入白皮书核心条款
  • XGBClassifier默认参数里藏了哪些坑?新手必看的6个实战避雷指南
  • 141. PLEG is not healthy 问题
  • MCNP新手避坑指南:用Fmesh卡计算钴-60源剂量当量,从几何建模到结果可视化的完整流程
  • 软件测试工程师的35岁破局之道:构建技术与管理双轨制晋升体系
  • 别再死记硬背AHP公式了!用Excel+Python 5分钟搞定数学建模里的权重计算
  • 手机号逆向查询QQ号:3步高效解决方案完整指南
  • 从“完美”执念到“价值”觉醒:一位测试工程师的3000用例优化心路
  • 逆向思维学PWN:通过GDB调试实战,动态理解寄存器与栈的变化
  • Docker 27网络策略升级全貌(隔离能力提升270%?内测数据首次公开)
  • 143. Rancher 支持 Docker 吗?
  • 告别U-Net独大?手把手带你用PyTorch和MONAI复现UNETR,搞定3D医学图像分割
  • GLM-TTS流式推理体验:低延迟实时语音合成,打造交互式应用
  • 避开这些坑!ESP32-CAM驱动舵机时GPIO冲突与电源问题的实战解决