当前位置：首页 > news >正文

使用Flask构建StructBERT情感分类模型Web服务

news 2026/7/1 20:20:10

使用Flask构建StructBERT情感分类模型Web服务

1. 引言

你是不是遇到过这样的情况：手里有一个很好用的AI模型，但每次使用都要打开Python环境，写一堆代码才能调用？或者想把这个模型分享给同事使用，但他们不懂技术，不知道怎么调用？今天我就来分享一个简单实用的解决方案——用Flask为StructBERT情感分类模型构建一个轻量级Web服务。

StructBERT情感分类模型是一个很实用的中文情感分析工具，它能判断一段文字是正面还是负面情绪。比如"这家餐厅的菜真好吃，服务也很周到"会被识别为正面，而"启动的时候很大声音，然后就会听到卡察的声音"则会被识别为负面。通过Web服务的方式，我们可以让这个模型更容易被使用，无论是通过网页界面还是API调用都很方便。

接下来，我会手把手教你如何搭建这个服务，即使你是Flask新手也能轻松跟上。我们会从环境准备开始，一步步实现一个完整的Web应用。

2. 环境准备与模型部署

在开始写代码之前，我们需要先把必要的环境搭建好。这个过程很简单，跟着我做就行。

首先安装必需的Python包：

pip install flask modelscope torch transformers

Flask是我们用来构建Web框架的核心，modelscope提供了StructBERT模型的接口，torch和transformers是模型运行的基础依赖。

安装完成后，我们来测试一下模型是否能正常使用。创建一个简单的测试脚本：

from modelscope.pipelines import pipeline from modelscope.utils.constant import Tasks # 创建情感分类管道 semantic_cls = pipeline( task=Tasks.text_classification, model='damo/nlp_structbert_sentiment-classification_chinese-base' ) # 测试一下 result = semantic_cls('这家餐厅的菜真好吃，服务也很周到') print(result)

如果运行正常，你会看到类似这样的输出：

{'label': 'positive', 'score': 0.9876}

这表示模型已经准备好了，能正确识别文本的情感倾向。label字段表示情感标签（positive或negative），score字段表示置信度分数。

3. Flask Web服务基础搭建

现在我们来创建Flask应用的基本结构。Flask是一个很轻量的Web框架，特别适合构建这种小型的API服务。

首先创建项目目录结构：

sentiment-app/ ├── app.py ├── templates/ │ └── index.html └── static/ └── style.css

然后编写主要的应用文件app.py：

from flask import Flask, render_template, request, jsonify from modelscope.pipelines import pipeline from modelscope.utils.constant import Tasks app = Flask(__name__) # 初始化模型管道 semantic_cls = pipeline( task=Tasks.text_classification, model='damo/nlp_structbert_sentiment-classification_chinese-base' ) @app.route('/') def home(): return render_template('index.html') if __name__ == '__main__': app.run(debug=True, host='0.0.0.0', port=5000)

这个基础版本创建了一个Flask应用，初始化了情感分类模型，并设置了一个首页路由。现在运行python app.py，访问http://localhost:5000就能看到网站了（虽然还没有内容）。

4. 实现情感分析API接口

接下来我们实现核心的API接口。这个接口接收文本输入，返回情感分析结果。

在app.py中添加新的路由：

@app.route('/api/sentiment', methods=['POST']) def analyze_sentiment(): try: # 获取请求数据 data = request.get_json() text = data.get('text', '') if not text: return jsonify({'error': '请输入文本内容'}), 400 # 进行情感分析 result = semantic_cls(text) # 格式化返回结果 response = { 'text': text, 'sentiment': result['label'], 'confidence': float(result['score']), 'status': 'success' } return jsonify(response) except Exception as e: return jsonify({'error': str(e), 'status': 'error'}), 500

这个API接口设计得很简单实用：

使用POST方法接收JSON数据
检查输入文本是否为空
调用模型进行情感分析
返回结构化的结果，包括原始文本、情感标签、置信度分数

你可以用curl命令测试这个接口：

curl -X POST http://localhost:5000/api/sentiment \ -H "Content-Type: application/json" \ -d '{"text":"这个产品质量很好，用起来很顺手"}'

应该会得到类似这样的响应：

{ "text": "这个产品质量很好，用起来很顺手", "sentiment": "positive", "confidence": 0.9567, "status": "success" }

5. 创建用户友好的Web界面

有了API接口后，我们再创建一个简单的网页界面，让用户可以通过浏览器直接使用情感分析功能。

在templates/index.html中创建页面：

<!DOCTYPE html> <html> <head> <title>情感分析工具</title> <link rel="stylesheet" href="{{ url_for('static', filename='style.css') }}"> </head> <body> <div class="container"> <h1>中文情感分析工具</h1> <p>输入一段中文文本，分析其情感倾向（正面/负面）</p> <textarea id="inputText" placeholder="请输入要分析的文本..."></textarea> <button onclick="analyzeSentiment()">分析情感</button> <div id="result" class="hidden"> <h3>分析结果</h3> <p>输入文本：<span id="originalText"></span></p> <p>情感倾向：<span id="sentiment"></span></p> <p>置信度：<span id="confidence"></span></p> </div> <div id="loading" class="hidden">分析中...</div> <div id="error" class="hidden error"></div> </div> <script> async function analyzeSentiment() { const text = document.getElementById('inputText').value.trim(); if (!text) { alert('请输入文本内容'); return; } // 显示加载中，隐藏其他状态 document.getElementById('loading').classList.remove('hidden'); document.getElementById('result').classList.add('hidden'); document.getElementById('error').classList.add('hidden'); try { const response = await fetch('/api/sentiment', { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ text: text }) }); const data = await response.json(); if (data.status === 'success') { // 显示结果 document.getElementById('originalText').textContent = data.text; document.getElementById('sentiment').textContent = data.sentiment; document.getElementById('confidence').textContent = (data.confidence * 100).toFixed(2) + '%'; document.getElementById('result').classList.remove('hidden'); // 根据情感倾向设置颜色 document.getElementById('sentiment').className = data.sentiment; } else { throw new Error(data.error || '分析失败'); } } catch (error) { document.getElementById('error').textContent = '错误：' + error.message; document.getElementById('error').classList.remove('hidden'); } finally { document.getElementById('loading').classList.add('hidden'); } } </script> </body> </html>

再添加一些基本的样式到static/style.css：

body { font-family: Arial, sans-serif; max-width: 800px; margin: 0 auto; padding: 20px; line-height: 1.6; } .container { background: #f9f9f9; padding: 20px; border-radius: 8px; } textarea { width: 100%; height: 100px; margin: 10px 0; padding: 10px; border: 1px solid #ddd; border-radius: 4px; } button { background: #007bff; color: white; padding: 10px 20px; border: none; border-radius: 4px; cursor: pointer; } button:hover { background: #0056b3; } .hidden { display: none; } .error { color: #dc3545; background: #f8d7da; padding: 10px; border-radius: 4px; margin: 10px 0; } .positive { color: #28a745; font-weight: bold; } .negative { color: #dc3545; font-weight: bold; }

现在刷新页面，你就能看到一个完整的情感分析工具了。输入文本，点击按钮，就能看到实时的分析结果。

6. 实用技巧与进阶功能

基础功能已经实现了，我再分享一些实用技巧和可以添加的进阶功能。

批量处理功能：有时候我们需要一次性分析多段文本，可以添加批量处理接口：

@app.route('/api/sentiment/batch', methods=['POST']) def analyze_sentiment_batch(): try: data = request.get_json() texts = data.get('texts', []) if not texts or not isinstance(texts, list): return jsonify({'error': '请输入文本列表'}), 400 results = [] for text in texts: if text.strip(): # 跳过空文本 result = semantic_cls(text.strip()) results.append({ 'text': text, 'sentiment': result['label'], 'confidence': float(result['score']) }) return jsonify({'results': results, 'status': 'success'}) except Exception as e: return jsonify({'error': str(e), 'status': 'error'}), 500

添加缓存机制：对于重复的文本，可以使用缓存提高性能：

from functools import lru_cache @lru_cache(maxsize=1000) def cached_sentiment_analysis(text): return semantic_cls(text) # 然后在API接口中使用缓存版本 result = cached_sentiment_analysis(text)

添加速率限制：防止API被滥用：

from flask_limiter import Limiter from flask_limiter.util import get_remote_address limiter = Limiter( app=app, key_func=get_remote_address, default_limits=["200 per day", "50 per hour"] ) # 给API接口添加限制 @app.route('/api/sentiment', methods=['POST']) @limiter.limit("10 per minute") def analyze_sentiment(): # 原有代码不变

添加健康检查接口：

@app.route('/health') def health_check(): try: # 简单测试一下模型是否正常工作 test_result = semantic_cls('测试') return jsonify({ 'status': 'healthy', 'model': 'working', 'timestamp': datetime.now().isoformat() }) except Exception as e: return jsonify({ 'status': 'unhealthy', 'error': str(e) }), 500

7. 部署与使用建议

开发完成后，你可能想要部署到服务器上。这里有一些建议：

生产环境部署：不要使用Flask自带的开发服务器部署生产环境，建议使用Gunicorn：

pip install gunicorn gunicorn -w 4 -b 0.0.0.0:5000 app:app

使用环境变量配置：创建.env文件管理配置：

import os from dotenv import load_dotenv load_dotenv() app.config['DEBUG'] = os.getenv('DEBUG', 'False').lower() == 'true' app.config['PORT'] = int(os.getenv('PORT', 5000))

添加日志记录：

import logging from logging.handlers import RotatingFileHandler # 配置日志 handler = RotatingFileHandler('app.log', maxBytes=10000, backupCount=1) handler.setLevel(logging.INFO) app.logger.addHandler(handler)

使用建议：