当前位置: 首页 > news >正文

【CANN训练营】体验基于Caffe ResNet-50网络实现图片分类实践操作

实例功能

很简单的一个实例,功能就是一个实现图片分类的功能,然后拓展实现以下

  • 将一张YUV420SP格式的图片编码为*.jpg格式的图片。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,缩放,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图贴图,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将YUV420SP NV12格式的图片(分辨率8192*8192)缩放,得到4000*4000。

环境及环境版本介绍

NPU:Ascend910(32GB)

CANN版本:CANN-8.0.RC3.alpha001

开始实践

创建conda环境

conda create -n cann_demopython=3.8-y conda activate cann_demo

安装CANN

wgethttps://ascend-repo.obs.cn-east-2.myhuaweicloud.com/Milan-ASL/Milan-ASL%20V100R001C20SPC703/Ascend-cann-toolkit_8.0.0.alpha003_linux-aarch64.runbashAscend-cann-toolkit_8.0.0.alpha003_linux-aarch64.run --full

激活环境变量

source/home/ma-user/Ascend/ascend-toolkit/set_env.sh

下载体验代码仓

gitclone -b v0.3-8.0.0.alpha003 https://gitee.com/Ascend/samples

进入示例文件夹

cdsamples/cplusplus/level2_simple_inference/1_classification/vpc_jpeg_resnet50_imagenet_classification

获取ResNet-50原始模型

下载模型CAFFE文件

cdcaffe_modelwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/003_Atc_Models/AE/ATC%20Model/resnet50/resnet50.caffemodelwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/003_Atc_Models/AE/ATC%20Model/resnet50/resnet50.prototxtcd..

安装依赖

pipinstalldecorator attrs psutil sympy scipy

转换模型

atc --model=caffe_model/resnet50.prototxt --weight=caffe_model/resnet50.caffemodel --framework=0--soc_version=Ascend910 --insert_op_conf=caffe_model/aipp.cfg --output=model/resnet50_aipp

准备测试图片

cddatawgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/dvpp_vpc_8192x8192_nv12.yuvwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/persian_cat_1024_1536_283.jpgwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/wood_rabbit_1024_1061_330.jpgwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/wood_rabbit_1024_1068_nv12.yuvcd..

编译运行

安装依赖

condainstall-c conda-forge cmake condainstall-c conda-forge binutils

创建目录

mkdir-p build/intermediates/host

设置环境变量

source/home/ma-user/Ascend/ascend-toolkit/set_env.shexportDDK_PATH=$HOME/Ascend/ascend-toolkit/latestexportNPU_HOST_LIB=$DDK_PATH/runtime/lib64/stub

生成编译文件

cdbuild/intermediates/host cmake../../../src -DCMAKE_CXX_COMPILER=g++ -DCMAKE_SKIP_RPATH=TRUEmake

运行

设置main文件权限为可运行

cd ../../../out chmod +x main

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,缩放,再进行模型推理,分别得到两张图片的推理结果

./main0

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] make directory successfully. [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcResize [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.969727], top5[0.979855] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcResize [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[331], top1[0.895508], top5[1.000134] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图,再进行模型推理,分别得到两张图片的推理结果。

./main1

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcCrop [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.996094], top5[0.999629] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcCrop [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[330], top1[0.859863], top5[1.000106] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图贴图,再进行模型推理,分别得到两张图片的推理结果。

./main2

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcCropAndPaste [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.431885], top5[0.751892] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcCropAndPaste [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[330], top1[0.685059], top5[0.969410] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将一张YUV420SP格式的图片编码为*.jpg格式的图片。

./main3

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] start to jpege picture ../data/wood_rabbit_1024_1068_nv12.yuv [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将一张分辨率为8192*8192的YUV420SP格式的图片缩放至4000*4000。

./main4

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp process 8k resize begin [INFO] dvpp init resource success [INFO] dvpp process 8k resize success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

整体运行结果

执行可执行文件成功后,同时会在main文件同级的result目录下生成结果文件,便于后期查看。结果文件如下:

  • dvpp_output_0:persian_cat_1024_1536_283.jpg:图片经过缩放或抠图或抠图贴图之后的结果图片。
  • dvpp_output_1:wood_rabbit_1024_1061_330.jpg:图片经过缩放或抠图或抠图贴图之后的结果图片。
  • model_output_0:persian_cat_1024_1536_283.jpg:图片的模型推理结果,二进制文件。
  • model_output_0.txt:persian_cat_1024_1536_283.jpg:图片的模型推理结果,txt文件。
  • model_output_1:wood_rabbit_1024_1061_330.jpg:图片的模型推理结果,二进制文件。
  • model_output_1.txt:wood_rabbit_1024_1061_330.jpg:图片的模型推理结果,txt文件。
  • jpege_output_0.jpg:wood_rabbit_1024_1068_nv12.yuv:图片结果编码后的结果图片。
  • dvpp_vpc_4000x4000_nv12.yuv:dvpp_vpc_8192x8192_nv12.yuv:图片缩放后的结果图片。
http://www.jsqmd.com/news/210484/

相关文章:

  • 智能教学助手:快速搭建教具识别课堂应用
  • 计算机视觉运维监控:模型服务健康度指标体系建设
  • 企业级PPK系统登录页面开发实战
  • 万物识别模型推理速度优化技巧:提升响应效率的方法
  • 宠物成长记录:按时间轴整理毛孩各阶段影像
  • AI艺术创作:结合万物识别技术的创意图像生成方案
  • 垃圾分类指导APP:拍照识别垃圾类型并提示投放方式
  • 码市VS传统开发:效率提升10倍的秘密
  • 虚拟试衣间技术:人体轮廓识别与服装贴合渲染
  • 中文优先设计!Hunyuan-MT-7B对中文语义理解更深刻
  • 盲文转换辅助:图像转语音描述系统构建
  • PyTorch与AI:如何用智能工具加速深度学习开发
  • XSS跨站脚本攻击防范:Hunyuan-MT-7B前端过滤机制
  • DDoS攻击防御策略在Hunyuan-MT-7B服务中的实施
  • 智能制造场景:MGeo统一全球工厂地址便于供应链协同
  • 乡村教师借助Hunyuan-MT-7B为学生提供双语教学资源
  • 低成本实验方案:用云端GPU按需运行中文物体识别模型
  • 游戏开发辅助:NPC行为模式识别算法参考实现
  • 模型解释性研究:中文物体识别系统的可视化分析
  • Adobe软件未授权警告:新手必读指南
  • Hunyuan-MT-7B-WEBUI和Typora官网一样好用?用户体验深度评测
  • 人力资源系统优化:MGeo校验员工住址真实性
  • Docker部署Hunyuan-MT-7B,实现环境隔离与快速迁移
  • 机器人避障策略优化:动态环境目标识别
  • 模型解释性研究:中文物体识别系统的可视化分析与调试
  • AI辅助设计:快速搭建中文素材识别系统
  • 成本对比:自建GPU服务器 vs 云端预置环境
  • JavaScript调用Hunyuan-MT-7B API?前端如何对接翻译接口
  • 【MCP Azure Stack HCI故障排查全攻略】:掌握5大核心诊断步骤,快速恢复生产环境
  • DVWA安全测试平台对比?Hunyuan-MT-7B主打的是功能而非漏洞