当前位置：首页 > news >正文

告别Labelme标注烦恼：手把手教你用Python脚本将Labelme数据一键转成YOLACT可用的COCO格式

news 2026/6/15 1:05:51

从Labelme到YOLACT：零基础实现自定义数据集格式转换全攻略

在计算机视觉领域，数据标注是模型训练前的关键准备工作，而格式转换往往是让开发者头疼的"最后一公里"。许多团队花费大量时间标注数据后，却发现模型要求的输入格式与现有标注格式不匹配，导致项目进度受阻。本文将彻底解决这一痛点，带你从Labelme标注文件出发，一步步生成符合YOLACT模型要求的COCO格式数据集。

1. 理解数据格式差异与转换原理

Labelme和COCO是两种截然不同的标注格式，理解它们的结构差异是成功转换的前提。

Labelme采用每个图像对应一个JSON文件的存储方式，文件中包含图像基本信息以及多边形标注点坐标。而COCO格式则将整个数据集的信息整合在一个JSON文件中，包含以下核心字段：

images: 所有图像的基本信息列表
annotations: 所有标注对象的详细信息列表
categories: 数据集中包含的类别列表

关键转换逻辑：

遍历所有Labelme JSON文件，提取图像信息存入images数组
解析每个JSON中的shapes数组，将每个多边形转换为COCO的annotation格式
建立类别名称到ID的映射关系，填充categories数组

注意：COCO格式要求每个标注对象必须有area(面积)和bbox(边界框)字段，这些需要从Labelme的多边形点坐标计算得出

2. 环境准备与项目结构搭建

在开始编写转换脚本前，需要确保开发环境配置正确。以下是推荐的环境配置：

# 创建Python虚拟环境 python -m venv labelme2coco source labelme2coco/bin/activate # Linux/Mac # labelme2coco\Scripts\activate # Windows # 安装必要依赖 pip install labelme numpy scikit-learn pillow

建议的项目目录结构如下：

/yolact_project │── /labelme_data │ ├── image1.jpg │ ├── image1.json │ ├── image2.jpg │ ├── image2.json │ └── ... ├── /coco │ ├── /annotations │ └── /images │ ├── /train2017 │ └── /val2017 └── labelme2coco.py

3. 核心转换脚本详解

下面我们分模块解析转换脚本的关键代码实现。完整脚本将处理以下核心功能：

数据集划分（训练集/验证集）
Labelme到COCO的格式映射
必要的几何计算（边界框、面积等）
文件复制与目录组织

3.1 类别定义与初始化

import os import json import numpy as np from sklearn.model_selection import train_test_split # 类别映射配置（根据实际标注修改） CLASSNAME_TO_ID = { "trafficcone": 1, "person": 2, "car": 3 # 添加更多类别... } class Labelme2CoCo: def __init__(self): self.images = [] self.annotations = [] self.categories = [] self.img_id = 0 self.ann_id = 0 def _init_categories(self): """初始化COCO的categories字段""" for name, id_ in CLASSNAME_TO_ID.items(): self.categories.append({ "id": id_, "name": name, "supercategory": "" # 可选字段 })

3.2 图像信息提取

def _image(self, labelme_obj, json_path): """构建COCO的image字段""" img_file = os.path.basename(json_path).replace(".json", ".jpg") return { "id": self.img_id, "file_name": img_file, "height": labelme_obj["imageHeight"], # 直接从Labelme读取 "width": labelme_obj["imageWidth"], "license": 1, # 可选字段 "date_captured": "" # 可选字段 }

3.3 标注信息转换

def _annotation(self, shape, image_id): """将Labelme的shape转换为COCO的annotation""" points = np.array(shape["points"]) segmentation = points.flatten().tolist() # 计算边界框 [x,y,width,height] min_x, min_y = np.min(points, axis=0) max_x, max_y = np.max(points, axis=0) bbox = [min_x, min_y, max_x - min_x, max_y - min_y] # 计算多边形面积 area = self._calculate_polygon_area(points) return { "id": self.ann_id, "image_id": image_id, "category_id": CLASSNAME_TO_ID[shape["label"]], "segmentation": [segmentation], "area": area, "bbox": bbox, "iscrowd": 0 # 0表示单个对象 } def _calculate_polygon_area(self, points): """使用Shoelace公式计算多边形面积""" x = points[:, 0] y = points[:, 1] return 0.5 * np.abs(np.dot(x, np.roll(y, 1)) - np.dot(y, np.roll(x, 1)))

4. 完整流程与错误排查

4.1 主流程实现

def main(): # 1. 准备目录结构 os.makedirs("coco/annotations", exist_ok=True) os.makedirs("coco/images/train2017", exist_ok=True) os.makedirs("coco/images/val2017", exist_ok=True) # 2. 获取所有Labelme JSON文件 json_files = [f for f in os.listdir("labelme_data") if f.endswith(".json")] json_paths = [os.path.join("labelme_data", f) for f in json_files] # 3. 划分训练集和验证集（80%训练，20%验证） train_files, val_files = train_test_split(json_paths, test_size=0.2, random_state=42) # 4. 转换训练集 converter = Labelme2CoCo() train_coco = converter.convert(train_files) with open("coco/annotations/instances_train2017.json", "w") as f: json.dump(train_coco, f, indent=2) # 5. 转换验证集 converter = Labelme2CoCo() val_coco = converter.convert(val_files) with open("coco/annotations/instances_val2017.json", "w") as f: json.dump(val_coco, f, indent=2) # 6. 复制图像文件到对应目录 for json_file in train_files: img_file = json_file.replace(".json", ".jpg") shutil.copy(img_file, "coco/images/train2017/") for json_file in val_files: img_file = json_file.replace(".json", ".jpg") shutil.copy(img_file, "coco/images/val2017/") if __name__ == "__main__": main()

4.2 常见问题与解决方案

问题1：类别ID映射错误

症状：转换后的JSON中某些类别的ID与预期不符解决方案：

检查CLASSNAME_TO_ID字典是否包含所有标注类别
确保Labelme标注时使用的类别名称与字典中的key完全一致（大小写敏感）

问题2：路径错误导致文件找不到

症状：脚本运行时抛出FileNotFoundError解决方案：

确认labelme_data目录存在且包含.jpg和.json文件
检查文件命名是否一致（如image1.jpg对应image1.json）
使用绝对路径而非相对路径

问题3：生成的COCO JSON验证失败

症状：YOLACT训练时提示标注文件格式错误解决方案：

使用在线COCO格式验证工具检查文件
确保所有必需字段都存在（特别是area和bbox）
检查多边形点坐标是否超出图像边界

5. 进阶技巧与性能优化

当处理大规模数据集时，原始脚本可能需要优化。以下是几个实用技巧：

批量处理加速：

from multiprocessing import Pool def process_file(json_path): # 单个文件的处理逻辑 pass with Pool(processes=4) as pool: # 使用4个进程 pool.map(process_file, json_paths)

内存优化：对于超大数据集，可以分批处理并保存：

CHUNK_SIZE = 1000 # 每1000个文件保存一次 for i in range(0, len(json_paths), CHUNK_SIZE): chunk = json_paths[i:i+CHUNK_SIZE] coco_data = converter.convert(chunk) with open(f"coco/annotations/instances_train2017_{i}.json", "w") as f: json.dump(coco_data, f)

可视化验证：转换后可使用以下代码抽样检查标注是否正确：

from PIL import Image, ImageDraw def visualize_annotation(img_path, ann): img = Image.open(img_path) draw = ImageDraw.Draw(img) # 绘制边界框 bbox = ann["bbox"] draw.rectangle([bbox[0], bbox[1], bbox[0]+bbox[2], bbox[1]+bbox[3]], outline="red", width=2) # 绘制多边形 seg = ann["segmentation"][0] points = [(seg[i], seg[i+1]) for i in range(0, len(seg), 2)] draw.polygon(points, outline="green") return img

在实际项目中，我们还需要考虑标注质量对模型性能的影响。建议转换完成后进行以下检查：