当前位置：首页 > news >正文

保姆级教程：用Python脚本将OPIXray/HIXray安检X光数据集转成YOLO格式（附完整代码）

news 2026/7/23 7:27:07

从安检X光图像到YOLO模型：OPIXray/HIXray数据集格式转换实战指南

安检X光图像分析一直是计算机视觉领域的重要应用场景。无论是机场安检还是地铁安防，快速准确地识别危险物品都至关重要。OPIXray和HIXray作为两个公开的安检X光数据集，为研究者提供了宝贵的训练资源。然而，这两个数据集默认提供的VOC格式标注与当前流行的YOLO目标检测框架不兼容，这成为了许多初学者上手时的第一个障碍。

本文将带你一步步完成从VOC到YOLO格式的完整转换过程。不同于简单的代码展示，我们会深入解析每个关键步骤的设计原理，分享实际项目中积累的经验技巧，并提供完整的可运行代码。无论你是刚开始接触计算机视觉的学生，还是需要快速验证模型效果的研究者，这篇指南都能帮你节省大量摸索时间。

1. 理解数据集与格式差异

在开始转换之前，我们需要清楚地了解原始数据集的组成和目标格式的要求。OPIXray和HIXray虽然都是安检X光图像数据集，但它们在类别设置和标注方式上有所不同。

1.1 数据集概览

OPIXray数据集主要包含五类刀具物品：

Straight_Knife（直刀）
Folding_Knife（折叠刀）
Scissor（剪刀）
Utility_Knife（工具刀）
Multi-tool_Knife（多功能刀具）

HIXray数据集则聚焦于电子设备和日常物品：

Mobile_Phone（手机）
Laptop（笔记本电脑）
Portable_Charger（移动电源两种型号）
Tablet（平板电脑）
Cosmetic（化妆品）
Water（水瓶）
Nonmetallic_Lighter（非金属打火机）

1.2 VOC与YOLO格式对比

VOC格式使用XML文件存储标注信息，每个对象用绝对坐标表示：

<object> <name>Straight_Knife</name> <bndbox> <xmin>100</xmin> <ymin>50</ymin> <xmax>300</xmax> <ymax>200</ymax> </bndbox> </object>

而YOLO格式使用简单的文本文件，采用相对坐标：

0 0.5 0.4 0.3 0.2

其中各数字分别表示：类别索引、中心点x坐标、中心点y坐标、宽度、高度，所有值都是相对于图像宽高的比例。

2. 环境准备与代码结构

2.1 所需工具

确保你的开发环境已安装以下Python库：

pip install opencv-python numpy

2.2 项目目录结构

建议按如下方式组织文件：

/project_root /datasets /OPIXray /images /labels_voc /HIXray /images /labels_voc /converted_labels convert.py

3. 核心转换代码解析

以下是完整的格式转换脚本，我们将分段解析关键函数。

3.1 坐标转换逻辑

VOC到YOLO的核心是坐标系的转换。这个转换过程在voc_to_yolo函数中实现：

def voc_to_yolo(size, box): """将VOC格式的绝对坐标转换为YOLO格式的相对坐标 参数: size: 图像尺寸 (h, w, c) box: VOC格式边界框 [xmin, ymin, xmax, ymax] 返回: list: YOLO格式坐标 [x_center, y_center, width, height] """ dw = 1.0 / size[1] # 宽度比例因子 dh = 1.0 / size[0] # 高度比例因子 # 计算中心点坐标 x_center = (box[0] + box[2]) / 2.0 y_center = (box[1] + box[3]) / 2.0 # 计算宽度和高度 width = box[2] - box[0] height = box[3] - box[1] # 转换为相对坐标 return [ x_center * dw, y_center * dh, width * dw, height * dh ]

注意：YOLO格式要求坐标值在0-1之间，因此所有计算都需要乘以dw/dh进行归一化。

3.2 类别映射处理

由于两个数据集的类别不同，我们需要分别处理：

def get_class_index(dataset_type, class_name): """根据数据集类型和类别名称获取YOLO格式的类别索引 参数: dataset_type: 'OPIXray' 或 'HIXray' class_name: VOC格式的类别名称 返回: str: 类别索引 """ if dataset_type == 'OPIXray': class_dict = { 'Straight_Knife': '0', 'Folding_Knife': '1', 'Scissor': '2', 'Utility_Knife': '3', 'Multi-tool_Knife': '4' } else: # HIXray class_dict = { 'Mobile_Phone': '0', 'Laptop': '1', 'Portable_Charger_2': '2', 'Portable_Charger_1': '3', 'Tablet': '4', 'Cosmetic': '5', 'Water': '6', 'Nonmetallic_Lighter': '7' } return class_dict.get(class_name, None) # 返回None如果类别不存在

3.3 主流程控制

主函数负责协调整个转换流程：

def convert_voc_to_yolo(dataset_type, voc_label_dir, image_dir, output_dir): """主转换函数 参数: dataset_type: 数据集类型 ('OPIXray' 或 'HIXray') voc_label_dir: VOC格式标签目录 image_dir: 图像文件目录 output_dir: 输出目录 """ if not os.path.exists(output_dir): os.makedirs(output_dir) for label_file in os.listdir(voc_label_dir): if not label_file.endswith('.txt'): continue image_name = label_file.replace('.txt', '.jpg') image_path = os.path.join(image_dir, image_name) # 读取图像获取尺寸 try: image = cv2.imread(image_path) if image is None: print(f"警告: 无法读取图像 {image_path}") continue h, w = image.shape[:2] except Exception as e: print(f"处理图像 {image_path} 时出错: {str(e)}") continue # 处理每个标注 with open(os.path.join(voc_label_dir, label_file), 'r') as f: lines = f.readlines() with open(os.path.join(output_dir, label_file), 'w') as out_f: for line in lines: parts = line.strip().split() if len(parts) < 6: # 至少: img_name class x1 y1 x2 y2 continue class_name = parts[1] class_idx = get_class_index(dataset_type, class_name) if class_idx is None: print(f"警告: 未知类别 {class_name} 在文件 {label_file}") continue box = list(map(float, parts[2:6])) yolo_box = voc_to_yolo((h, w), box) # 写入YOLO格式 out_f.write(f"{class_idx} {' '.join(map(str, yolo_box))}\n")

4. 实际应用与问题排查

4.1 路径配置与运行

创建配置文件config.ini方便修改路径：

[PATHS] OPIXray_images = ./datasets/OPIXray/images OPIXray_labels_voc = ./datasets/OPIXray/labels_voc HIXray_images = ./datasets/HIXray/images HIXray_labels_voc = ./datasets/HIXray/labels_voc output_dir = ./converted_labels

主程序调整为读取配置：

import configparser config = configparser.ConfigParser() config.read('config.ini') # 转换OPIXray convert_voc_to_yolo( 'OPIXray', config['PATHS']['OPIXray_labels_voc'], config['PATHS']['OPIXray_images'], os.path.join(config['PATHS']['output_dir'], 'OPIXray') ) # 转换HIXray convert_voc_to_yolo( 'HIXray', config['PATHS']['HIXray_labels_voc'], config['PATHS']['HIXray_images'], os.path.join(config['PATHS']['output_dir'], 'HIXray') )

4.2 常见问题解决方案

问题1：标注��件与图像不匹配

提示：使用以下代码验证图像和标注的对应关系

def validate_dataset(image_dir, label_dir): """验证图像和标注文件是否匹配""" images = set(f.replace('.jpg', '') for f in os.listdir(image_dir) if f.endswith('.jpg')) labels = set(f.replace('.txt', '') for f in os.listdir(label_dir) if f.endswith('.txt')) print(f"独有图像: {images - labels}") print(f"独有标签: {labels - images}") print(f"共同文件: {len(images & labels)}对")

问题2：标注错误处理

对于OPIXray中可能存在的标注问题，建议可视化检查：

def visualize_yolo_labels(image_path, label_path): """可视化YOLO格式标注""" image = cv2.imread(image_path) h, w = image.shape[:2] with open(label_path, 'r') as f: for line in f: class_idx, x, y, bw, bh = map(float, line.split()) # 转换回绝对坐标 x_abs = int(x * w) y_abs = int(y * h) bw_abs = int(bw * w) bh_abs = int(bh * h) # 计算矩形坐标 x1 = x_abs - bw_abs // 2 y1 = y_abs - bh_abs // 2 x2 = x_abs + bw_abs // 2 y2 = y_abs + bh_abs // 2 # 绘制 cv2.rectangle(image, (x1, y1), (x2, y2), (0, 255, 0), 2) cv2.putText(image, str(int(class_idx)), (x1, y1-5), cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 0, 255), 1) cv2.imshow('Annotation', image) cv2.waitKey(0) cv2.destroyAllWindows()

4.3 性能优化技巧

处理大型数据集时，可以考虑以下优化：

并行处理：使用Python的multiprocessing模块加速

from multiprocessing import Pool def process_file(args): """包装函数用于并行处理""" label_file, image_dir, output_dir, dataset_type = args # ... 处理逻辑 ... # 在主函数中 with Pool(processes=4) as pool: # 使用4个进程 args_list = [(f, image_dir, output_dir, dataset_type) for f in os.listdir(voc_label_dir) if f.endswith('.txt')] pool.map(process_file, args_list)

增量处理：记录已处理文件，支持断点续传

processed_file = os.path.join(output_dir, '.processed') processed = set() # 读取已处理文件 if os.path.exists(processed_file): with open(processed_file, 'r') as f: processed.update(f.read().splitlines()) # ... 处理过程中 ... if label_file not in processed: # 处理文件 with open(processed_file, 'a') as f: f.write(f"{label_file}\n")

5. 进阶应用与扩展

5.1 支持更多数据集格式

我们可以扩展代码以支持更多格式的转换。以下是一个工厂模式的实现：

class ConverterFactory: @staticmethod def get_converter(input_format, output_format): if input_format == 'VOC' and output_format == 'YOLO': return VOCtoYOLOConverter() elif input_format == 'COCO' and output_format == 'YOLO': return COCOtoYOLOConverter() # 添加更多转换器 else: raise ValueError(f"不支持的转换: {input_format}到{output_format}") class VOCtoYOLOConverter: def convert(self, annotation, image_size): # VOC到YOLO的转换逻辑 pass class COCOtoYOLOConverter: def convert(self, annotation, image_size): # COCO到YOLO的转换逻辑 pass

5.2 数据集统计分析

转换完成后，进行数据分析有助于理解数据集特征：

def analyze_dataset(label_dir, class_names): """分析YOLO格式数据集的分布""" class_counts = {name: 0 for name in class_names} bbox_sizes = [] for label_file in os.listdir(label_dir): with open(os.path.join(label_dir, label_file), 'r') as f: for line in f: parts = line.split() if len(parts) != 5: continue class_idx = int(parts[0]) class_counts[class_names[class_idx]] += 1 # 记录边界框大小 _, _, bw, bh = map(float, parts[1:]) bbox_sizes.append((bw, bh)) print("类别分布:") for name, count in class_counts.items(): print(f"{name}: {count}") # 计算平均边界框大小 if bbox_sizes: avg_w = sum(w for w, h in bbox_sizes) / len(bbox_sizes) avg_h = sum(h for w, h in bbox_sizes) / len(bbox_sizes) print(f"平均边界框大小: {avg_w:.3f}x{avg_h:.3f} (相对图像尺寸)")

5.3 与训练流程集成

将转换脚本集成到YOLO训练流程中，创建完整的data.yaml配置文件：

def create_yolo_config(dataset_name, class_names, train_dir, val_dir, test_dir=None): """生成YOLO训练所需的data.yaml文件""" config = { 'train': train_dir, 'val': val_dir, 'nc': len(class_names), 'names': class_names } if test_dir: config['test'] = test_dir with open(f'{dataset_name}.yaml', 'w') as f: yaml.dump(config, f, default_flow_style=False)

在实际项目中，我发现将数据转换、分析和训练流程自动化可以显著提高效率。例如，可以创建一个完整的pipeline脚本，从原始数据下载到最终模型训练一气呵成。对于OPIXray和HIXray这样的标准数据集，建议将转换后的YOLO格式数据也分享给团队成员，避免重复工作。

查看全文

http://www.jsqmd.com/news/927273/