当前位置：首页 > news >正文

YOLOv7姿态估计实战：从Labelme标注到训练数据准备的完整避坑指南（附代码）

news 2026/6/18 8:00:02

YOLOv7姿态估计实战：从Labelme标注到训练数据准备的完整避坑指南（附代码）

在计算机视觉项目中，数据准备环节往往是最容易被忽视却又最容易出问题的阶段。特别是当我们需要将Labelme标注的JSON文件转换为YOLO格式时，会遇到各种预料之外的错误和陷阱。本文将带你完整走一遍从Labelme标注到YOLOv7训练数据准备的全流程，重点解决实际项目中常见的FileExistsError等问题，并提供可直接运行的Python脚本。

1. 环境准备与项目结构规划

在开始数据转换前，合理的项目结构规划能避免80%的路径问题。建议采用以下目录结构：

project_root/ ├── datasets/ │ ├── labelme_annotations/ # 存放原始Labelme标注的JSON文件 │ │ ├── train/ │ │ ├── val/ │ │ └── test/ │ └── yolo_labels/ # 转换后的YOLO格式标签 │ ├── train/ │ ├── val/ │ └── test/ ├── images/ # 对应的图像文件 │ ├── train/ │ ├── val/ │ └── test/ └── scripts/ # 存放转换脚本

关键点注意事项：

确保所有路径中不包含中文或特殊字符
训练集、验证集和测试集的划分比例建议为7:2:1
图像和标注文件应当同名（仅扩展名不同）

2. Labelme标注格式解析

Labelme生成的JSON文件包含丰富的标注信息，我们需要理解其结构才能正确转换。一个典型的Labelme标注文件包含以下核心字段：

{ "version": "5.1.1", "flags": {}, "shapes": [ { "label": "grape", "points": [[x1, y1], [x2, y2]], "group_id": null, "shape_type": "rectangle", "flags": {} }, { "label": "picking", "points": [[x, y]], "group_id": null, "shape_type": "point", "flags": {} } ], "imagePath": "image.jpg", "imageData": null, "imageHeight": 1080, "imageWidth": 1920 }

对于姿态估计任务，我们通常会有两种类型的标注：

边界框（shape_type为"rectangle"）
关键点（shape_type为"point"）

3. YOLO格式转换实战

3.1 基础转换脚本

以下是一个完整的Labelme转YOLO格式的Python脚本，解决了常见的FileExistsError问题：

import os import json import shutil from tqdm import tqdm def convert_labelme_to_yolo(labelme_json_path, output_dir, bbox_classes, keypoint_classes): """ 将Labelme标注转换为YOLO格式 参数: labelme_json_path: Labelme JSON文件路径 output_dir: YOLO格式输出目录 bbox_classes: 边界框类别字典，如{'grape': 0} keypoint_classes: 关键点类别列表，如['picking'] """ # 创建输出目录（解决FileExistsError问题） os.makedirs(output_dir, exist_ok=True) with open(labelme_json_path, 'r', encoding='utf-8') as f: labelme_data = json.load(f) img_width = labelme_data['imageWidth'] img_height = labelme_data['imageHeight'] output_path = os.path.join(output_dir, os.path.splitext(os.path.basename(labelme_json_path))[0] + '.txt') with open(output_path, 'w', encoding='utf-8') as f: # 处理每个边界框 for shape in labelme_data['shapes']: if shape['shape_type'] == 'rectangle': # 边界框类别ID class_id = bbox_classes[shape['label']] # 边界框坐标处理 x1, y1 = shape['points'][0] x2, y2 = shape['points'][1] # 计算归一化中心坐标和宽高 x_center = ((x1 + x2) / 2) / img_width y_center = ((y1 + y2) / 2) / img_height width = abs(x2 - x1) / img_width height = abs(y2 - y1) / img_height # 写入边界框信息 f.write(f"{class_id} {x_center:.6f} {y_center:.6f} {width:.6f} {height:.6f}") # 收集该边界框内的关键点 keypoints_in_bbox = {} for kp_shape in labelme_data['shapes']: if kp_shape['shape_type'] == 'point': kp_x, kp_y = kp_shape['points'][0] if x1 <= kp_x <= x2 and y1 <= kp_y <= y2: keypoints_in_bbox[kp_shape['label']] = [kp_x, kp_y] # 按顺序写入关键点信息 for kp_class in keypoint_classes: if kp_class in keypoints_in_bbox: kp_x = keypoints_in_bbox[kp_class][0] / img_width kp_y = keypoints_in_bbox[kp_class][1] / img_height f.write(f" {kp_x:.6f} {kp_y:.6f} 2") # 2表示关键点可见 else: f.write(" 0 0 0") # 0表示关键点不存在 f.write("\n") def batch_convert_labelme_to_yolo(input_dir, output_root, bbox_classes, keypoint_classes, splits=['train', 'val', 'test']): """ 批量转换Labelme标注到YOLO格式 参数: input_dir: 包含Labelme标注的根目录 output_root: YOLO格式输出根目录 bbox_classes: 边界框类别字典 keypoint_classes: 关键点类别列表 splits: 数据集划分列表 """ for split in splits: input_split_dir = os.path.join(input_dir, split) output_split_dir = os.path.join(output_root, split) # 安全创建目录（避免FileExistsError） os.makedirs(output_split_dir, exist_ok=True) print(f"正在处理 {split} 集...") for filename in tqdm(os.listdir(input_split_dir)): if filename.endswith('.json'): json_path = os.path.join(input_split_dir, filename) convert_labelme_to_yolo(json_path, output_split_dir, bbox_classes, keypoint_classes) # 使用示例 if __name__ == '__main__': # 定义类别 bbox_classes = {'grape': 0} # 边界框类别到ID的映射 keypoint_classes = ['picking'] # 关键点类别列表 # 设置路径 labelme_root = 'datasets/labelme_annotations' yolo_labels_root = 'datasets/yolo_labels' # 执行批量转换 batch_convert_labelme_to_yolo(labelme_root, yolo_labels_root, bbox_classes, keypoint_classes)

3.2 关键改进点

目录创建安全性：
- 使用os.makedirs(output_dir, exist_ok=True)替代os.mkdir()
- exist_ok=True参数确保目录已存在时不会抛出FileExistsError
路径处理健壮性：
- 使用os.path模块处理路径，确保跨平台兼容性
- 正确处理文件名和扩展名
关键点与边界框关联：
- 自动将关键点关联到对应的边界框内
- 确保关键点顺序一致

4. 常见问题与解决方案

4.1 FileExistsError问题深度解析

FileExistsError: [WinError 183] 当文件已存在时，无法创建该文件错误通常发生在以下场景：

尝试创建已存在的目录
多线程/多进程环境下同时创建目录
路径权限问题

解决方案对比表：

方法	优点	缺点	适用场景
`os.mkdir()`	简单直接	存在时抛出异常	确保目录不存在的场景
`os.makedirs(exist_ok=True)`	自动处理已存在情况	可能掩盖真正的问题	大多数情况推荐使用
先检查后创建 (`if not os.path.exists`)	完全控制流程	存在竞态条件风险	不推荐在多线程中使用

4.2 Windows系统特有问题

在Windows平台上，还需要注意以下问题：

路径分隔符：
- 使用/或\\，避免单独使用\（转义字符问题）
- 推荐使用os.path.join()自动处理
长路径问题：
- Windows默认限制260字符路径长度
- 解决方案：
```
import os os.environ["PYTHONLEGACYWINDOWSFSENCODING"] = "1"
```
文件锁定问题：
- Windows对文件访问有严格锁定机制
- 确保及时关闭文件句柄

4.3 其他常见错误

UnicodeDecodeError：
- 解决方案：明确指定文件编码
```
with open(filepath, 'r', encoding='utf-8') as f:
```

JSON解码错误：

确保Labelme生成的JSON文件完整
添加错误处理：

try: data = json.load(f) except json.JSONDecodeError as e: print(f"Invalid JSON file: {filepath}") continue

5. 高级技巧与优化建议

5.1 并行处理加速

对于大规模数据集，可以使用多进程加速转换：

from multiprocessing import Pool def process_file(args): json_path, output_dir, bbox_classes, keypoint_classes = args try: convert_labelme_to_yolo(json_path, output_dir, bbox_classes, keypoint_classes) return True except Exception as e: print(f"Error processing {json_path}: {str(e)}") return False def parallel_convert(input_dir, output_dir, bbox_classes, keypoint_classes, workers=4): file_args = [] for filename in os.listdir(input_dir): if filename.endswith('.json'): json_path = os.path.join(input_dir, filename) file_args.append((json_path, output_dir, bbox_classes, keypoint_classes)) with Pool(workers) as p: results = p.map(process_file, file_args) success_rate = sum(results) / len(results) print(f"转换完成，成功率: {success_rate:.2%}")

5.2 数据验证

转换完成后，建议进行数据验证：

标注可视化检查：

import cv2 import numpy as np def visualize_yolo_label(image_path, label_path, class_names, keypoint_names): image = cv2.imread(image_path) img_h, img_w = image.shape[:2] with open(label_path, 'r') as f: lines = f.readlines() for line in lines: parts = line.strip().split() class_id = int(parts[0]) x_center, y_center, width, height = map(float, parts[1:5]) # 转换回像素坐标 x1 = int((x_center - width/2) * img_w) y1 = int((y_center - height/2) * img_h) x2 = int((x_center + width/2) * img_w) y2 = int((y_center + height/2) * img_h) # 绘制边界框 cv2.rectangle(image, (x1, y1), (x2, y2), (0, 255, 0), 2) cv2.putText(image, class_names[class_id], (x1, y1-10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (0, 255, 0), 2) # 绘制关键点 for i in range(5, len(parts), 3): kp_x = float(parts[i]) * img_w kp_y = float(parts[i+1]) * img_h visibility = int(parts[i+2]) if visibility > 0: color = (0, 0, 255) if visibility == 2 else (0, 255, 255) cv2.circle(image, (int(kp_x), int(kp_y)), 5, color, -1) kp_name = keypoint_names[(i-5)//3] cv2.putText(image, kp_name, (int(kp_x)+10, int(kp_y)), cv2.FONT_HERSHEY_SIMPLEX, 0.5, color, 1) cv2.imshow('Annotation', image) cv2.waitKey(0) cv2.destroyAllWindows()

数据集完整性检查：
- 确保每个JSON文件都有对应的图像文件
- 检查标注是否包含所有必需的关键点

5.3 与YOLOv7训练配置集成

完成数据准备后，需要创建YOLOv7的配置文件：

data.yaml示例：

# 训练/验证/测试集的图像路径（相对于YOLOv7根目录） train: ../datasets/images/train val: ../datasets/images/val test: ../datasets/images/test # 类别数量 nc: 1 # 边界框类别数 nkpt: 1 # 每个边界框的关键点数 # 关键点名称和骨架连接（可选） kpt_names: ['picking'] skeleton: [] # 类别名称 names: ['grape']