当前位置：首页 > news >正文

LFM2.5-VL-1.6B作品分享：葡萄酒酒标图→产区识别+年份判断+品鉴笔记生成

news 2026/7/13 9:56:51

LFM2.5-VL-1.6B作品分享：葡萄酒酒标图→产区识别+年份判断+品鉴笔记生成

1. 项目概述

LFM2.5-VL-1.6B是Liquid AI发布的一款轻量级多模态模型，专为端侧和边缘设备设计。这款模型结合了1.2B参数的语言模型和约400M参数的视觉模型，能够在低显存环境下快速响应。

项目	值
模型名称	LFM2.5-VL-1.6B
开发商	Liquid AI
参数量	1.6B
类型	视觉语言模型 (Vision-Language)
模型路径	`/root/ai-models/LiquidAI/LFM2___5-VL-1___6B`
WebUI 地址	http://localhost:7860

2. 硬件要求与快速启动

2.1 硬件配置要求

这款模型对硬件要求相对友好，适合在普通工作站上运行：

组件	要求
GPU	NVIDIA GPU (推荐 8GB+ 显存)
当前配置	RTX 4090 D, 22.15 GB 可用
内存占用	~3 GB GPU

2.2 快速启动方式

WebUI方式（推荐）

# 查看服务状态 supervisorctl status lfm-vl # 重启服务 supervisorctl restart lfm-vl # 查看日志 tail -f /var/log/lfm-vl.out.log

启动后访问 http://localhost:7860 即可使用Web界面。

命令行方式

cd /root/LFM2.5-VL-1.6B python webui.py

3. 葡萄酒酒标识别实战

3.1 准备图片输入

我们可以使用本地图片或网络图片作为输入：

from PIL import Image from transformers.image_utils import load_image # 本地图片 local_image = Image.open("wine_label.jpg").convert('RGB') # 网络图片 url_image = load_image("https://example.com/wine_label.jpg")

3.2 产区识别与年份判断

conversation = [ { "role": "user", "content": [ {"type": "image", "image": wine_image}, {"type": "text", "text": "请识别这款葡萄酒的产区和年份"} ] } ] # 生成回复 text = processor.apply_chat_template( conversation, add_generation_prompt=True, tokenize=False, ) inputs = processor.tokenizer( text, return_tensors="pt", padding=True, truncation=True, max_length=2048, ) inputs = {k: v.to(model.device) for k, v in inputs.items()} with torch.no_grad(): outputs = model.generate( **inputs, max_new_tokens=256, temperature=0.1, min_p=0.15, do_sample=True, ) response = processor.batch_decode(outputs, skip_special_tokens=True)[0].strip() print(response)

3.3 品鉴笔记生成

conversation = [ { "role": "user", "content": [ {"type": "image", "image": wine_image}, {"type": "text", "text": "请为这款葡萄酒生成专业的品鉴笔记，包括色泽、香气、口感和余味"} ] } ] # 使用更富创意的参数 with torch.no_grad(): outputs = model.generate( **inputs, max_new_tokens=512, temperature=0.7, min_p=0.15, do_sample=True, )

4. 效果展示与案例分析

4.1 实际案例展示

我们测试了几款不同产区的葡萄酒，模型表现如下：

法国波尔多红酒
- 准确识别：Château Margaux 2015
- 品鉴笔记：深宝石红色，带有黑醋栗、雪松和烟草的复杂香气，单宁柔顺，余味悠长
意大利巴罗洛
- 准确识别：Barolo DOCG 2016
- 品鉴笔记：石榴红色，玫瑰花瓣和樱桃的香气，高酸度，单宁强劲，适合陈年
美国纳帕谷赤霞珠
- 准确识别：Napa Valley Cabernet Sauvignon 2018
- 品鉴笔记：深紫色，黑莓和香草的浓郁香气，酒体饱满，单宁成熟

4.2 生成参数建议

针对葡萄酒识别任务，推荐以下生成参数：

任务	temperature	min_p	max_new_tokens
产区年份识别	0.1	0.15	256
品鉴笔记生成	0.7	0.15	512
餐酒搭配建议	0.5	0.2	384

5. 高级功能与应用扩展

5.1 多语言支持

模型支持多种语言的问答和生成：

# 法语提问 conversation = [ { "role": "user", "content": [ {"type": "image", "image": wine_image}, {"type": "text", "text": "Décrivez ce vin en français"} ] } ] # 日语提问 conversation = [ { "role": "user", "content": [ {"type": "image", "image": wine_image}, {"type": "text", "text": "このワインについて日本語で説明してください"} ] } ]

5.2 批量处理与自动化

可以结合Python脚本实现批量酒标识别：

import os from tqdm import tqdm wine_dir = "wine_labels/" results = [] for filename in tqdm(os.listdir(wine_dir)): if filename.endswith((".jpg", ".png")): image = Image.open(os.path.join(wine_dir, filename)).convert('RGB') # 构建对话并生成结果 # 将结果保存到results列表或文件中