当前位置：首页 > news >正文

GLM-4.1V-9B-Base部署教程：容器内Python API调用方式与requests示例

news 2026/7/29 1:18:22

GLM-4.1V-9B-Base部署教程：容器内Python API调用方式与requests示例

1. 模型简介

GLM-4.1V-9B-Base是智谱开源的视觉多模态理解模型，专注于图像内容识别与分析任务。这个9B参数规模的模型在中文视觉理解方面表现出色，能够准确识别图片内容、描述场景特征，并回答与图像相关的各种问题。

模型的核心优势在于：

原生支持中文视觉问答
对复杂场景有较强的理解能力
能够识别图片中的主体对象和细节特征
支持多种视觉理解任务

2. 环境准备

2.1 基础环境要求

在开始API调用前，请确保你的环境满足以下要求：

Python 3.8或更高版本
已安装Docker和docker-compose
至少16GB可用内存
支持CUDA的NVIDIA GPU（推荐RTX 3090或更高）

2.2 安装必要依赖

pip install requests pillow numpy

3. API调用基础

3.1 服务启动

首先需要启动GLM-4.1V-9B-Base的容器服务：

docker-compose up -d

服务启动后，API默认监听7860端口。你可以通过以下命令检查服务状态：

curl http://localhost:7860/health

3.2 基础请求结构

使用Python的requests库调用API的基本结构如下：

import requests from PIL import Image import io # 准备图片数据 image_path = "example.jpg" image = Image.open(image_path) img_byte_arr = io.BytesIO() image.save(img_byte_arr, format='JPEG') img_byte_arr = img_byte_arr.getvalue() # 构造请求 url = "http://localhost:7860/api/v1/analyze" files = {'image': ('example.jpg', img_byte_arr, 'image/jpeg')} data = {'question': '请描述这张图片的内容'} response = requests.post(url, files=files, data=data) print(response.json())

4. 实用调用示例

4.1 图片内容描述

def describe_image(image_path): image = Image.open(image_path) img_byte_arr = io.BytesIO() image.save(img_byte_arr, format='JPEG') response = requests.post( "http://localhost:7860/api/v1/analyze", files={'image': (image_path, img_byte_arr.getvalue(), 'image/jpeg')}, data={'question': '请详细描述这张图片的内容'} ) if response.status_code == 200: return response.json()['answer'] else: raise Exception(f"API请求失败: {response.text}") # 使用示例 description = describe_image("landscape.jpg") print(f"图片描述: {description}")

4.2 视觉问答示例

def visual_qa(image_path, question): with open(image_path, 'rb') as img_file: response = requests.post( "http://localhost:7860/api/v1/analyze", files={'image': (image_path, img_file, 'image/jpeg')}, data={'question': question} ) if response.status_code == 200: return response.json() else: raise Exception(f"请求失败: {response.status_code}") # 使用示例 result = visual_qa("product.jpg", "这张图片中的产品是什么颜色的?") print(f"回答: {result['answer']}")

5. 高级调用技巧

5.1 批量图片处理

def batch_process(image_paths, questions): results = [] for img_path, question in zip(image_paths, questions): try: with open(img_path, 'rb') as img_file: response = requests.post( "http://localhost:7860/api/v1/analyze", files={'image': (img_path, img_file, 'image/jpeg')}, data={'question': question}, timeout=30 ) results.append(response.json()) except Exception as e: results.append({'error': str(e)}) return results # 使用示例 images = ["img1.jpg", "img2.jpg", "img3.jpg"] questions = [ "图片中有什么物体?", "这张图片的主要颜色是什么?", "请用一句话描述这张图片" ] batch_results = batch_process(images, questions)

5.2 带参数的请求

def analyze_with_params(image_path, question, max_tokens=100, temperature=0.7): with open(image_path, 'rb') as img_file: response = requests.post( "http://localhost:7860/api/v1/analyze", files={'image': (image_path, img_file, 'image/jpeg')}, data={ 'question': question, 'max_tokens': str(max_tokens), 'temperature': str(temperature) } ) return response.json() # 使用示例 result = analyze_with_params( "artwork.jpg", "请分析这幅艺术作品的风格特点", max_tokens=150, temperature=0.5 )