当前位置：首页 > news >正文

Siamese网络实战：用Python手把手教你实现人脸相似度对比（附完整代码）

news 2026/3/26 6:49:29

Siamese网络实战：用Python手把手教你实现人脸相似度对比（附完整代码）

当我们需要判断两张人脸照片是否属于同一个人时，传统分类网络往往力不从心——尤其是当训练数据中缺乏目标人物样本时。Siamese网络（孪生神经网络）通过特征向量距离比较而非直接分类，完美解决了这一难题。本文将用PyTorch框架带你从零实现一个可商用的人脸相似度对比系统。

1. 环境配置与数据准备

1.1 基础环境搭建

推荐使用Anaconda创建隔离的Python环境：

conda create -n siamese python=3.8 conda activate siamese pip install torch==1.12.0 torchvision==0.13.0 pip install opencv-python matplotlib tqdm

1.2 人脸数据集处理

我们使用LFW（Labeled Faces in the Wild）数据集，包含13,000+人脸图像。关键预处理步骤：

人脸对齐：使用dlib检测68个关键点后对齐

import dlib detector = dlib.get_frontal_face_detector() predictor = dlib.shape_predictor("shape_predictor_68_face_landmarks.dat")

数据增强策略：
- 随机水平翻转（p=0.5）
- 颜色抖动（亮度/对比度调整）
- 标准化：mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]

提示：正负样本比例建议控制在1:3，避免模型偏向输出"不相似"

2. 网络架构设计

2.1 特征提取主干网络

我们基于ResNet18修改最后一层：

from torchvision import models class FeatureExtractor(nn.Module): def __init__(self): super().__init__() resnet = models.resnet18(pretrained=True) self.features = nn.Sequential(*list(resnet.children())[:-1]) def forward(self, x): return self.features(x).flatten(1)

2.2 孪生网络结构

class SiameseNetwork(nn.Module): def __init__(self): super().__init__() self.encoder = FeatureExtractor() self.fc = nn.Sequential( nn.Linear(512, 256), nn.ReLU(), nn.Linear(256, 128) ) def forward(self, x1, x2): feat1 = self.fc(self.encoder(x1)) feat2 = self.fc(self.encoder(x2)) return feat1, feat2

2.3 对比损失函数实现

Contrastive Loss的PyTorch实现：

class ContrastiveLoss(nn.Module): def __init__(self, margin=2.0): super().__init__() self.margin = margin def forward(self, feat1, feat2, label): distance = F.pairwise_distance(feat1, feat2) loss = torch.mean( (1-label) * torch.pow(distance, 2) + label * torch.pow(torch.clamp(self.margin - distance, min=0.0), 2) ) return loss

3. 模型训练技巧

3.1 训练参数配置

optimizer = torch.optim.Adam(model.parameters(), lr=1e-4) scheduler = torch.optim.lr_scheduler.ReduceLROnPlateau( optimizer, mode='min', patience=3 )

3.2 关键训练指标监控

建议记录以下指标：

正样本对距离均值
负样本对距离均值
准确率（阈值设为margin/2）

3.3 困难样本挖掘

每3个epoch执行一次：

计算所有样本对距离
选择正样本中距离最大的top 20%
选择负样本中距离最小的top 20%

4. 实际应用优化

4.1 实时人脸对比方案

def compare_faces(img1_path, img2_path, threshold=1.0): img1 = preprocess(img1_path) img2 = preprocess(img2_path) with torch.no_grad(): feat1, feat2 = model(img1, img2) distance = F.pairwise_distance(feat1, feat2).item() return distance < threshold