当前位置：首页 > news >正文

SenseVoice-small-onnx REST API安全接入：JWT鉴权与请求限流配置指南

news 2026/3/26 15:31:37

SenseVoice-small-onnx REST API安全接入：JWT鉴权与请求限流配置指南

1. 服务概述

SenseVoice-small-onnx是基于ONNX量化的多语言语音识别服务，支持中文、粤语、英语、日语、韩语等多种语言的自动识别。该服务通过REST API提供高效的语音转写能力，10秒音频推理仅需70毫秒。

核心优势：

轻量级量化模型（230M）
自动语言检测（支持50+种语言）
富文本转写（含情感识别和音频事件检测）
简单易用的HTTP接口

2. 基础API部署

2.1 环境准备

# 安装依赖 pip install funasr-onnx gradio fastapi uvicorn soundfile jieba

2.2 启动基础服务

python3 app.py --host 0.0.0.0 --port 7860

启动后可通过以下地址访问：

Web界面：http://localhost:7860
API文档：http://localhost:7860/docs
健康检查：http://localhost:7860/health

3. JWT鉴权配置

3.1 为什么需要JWT鉴权

开放API接口存在被恶意滥用的风险。JWT(JSON Web Token)提供了一种轻量级的身份验证机制，确保只有授权用户能够访问API服务。

3.2 安装JWT依赖

pip install python-jose[cryptography] passlib[bcrypt]

3.3 修改FastAPI应用代码

在app.py中添加以下JWT相关代码：

from fastapi import Depends, HTTPException, status from fastapi.security import OAuth2PasswordBearer from jose import JWTError, jwt from passlib.context import CryptContext # 安全配置 SECRET_KEY = "your-secret-key-here" # 生产环境应从环境变量获取 ALGORITHM = "HS256" ACCESS_TOKEN_EXPIRE_MINUTES = 30 pwd_context = CryptContext(schemes=["bcrypt"], deprecated="auto") oauth2_scheme = OAuth2PasswordBearer(tokenUrl="token") # 用户验证逻辑 def verify_password(plain_password, hashed_password): return pwd_context.verify(plain_password, hashed_password) def create_access_token(data: dict): to_encode = data.copy() expire = datetime.utcnow() + timedelta(minutes=ACCESS_TOKEN_EXPIRE_MINUTES) to_encode.update({"exp": expire}) encoded_jwt = jwt.encode(to_encode, SECRET_KEY, algorithm=ALGORITHM) return encoded_jwt # 保护API端点 async def get_current_user(token: str = Depends(oauth2_scheme)): credentials_exception = HTTPException( status_code=status.HTTP_401_UNAUTHORIZED, detail="无法验证凭据", headers={"WWW-Authenticate": "Bearer"}, ) try: payload = jwt.decode(token, SECRET_KEY, algorithms=[ALGORITHM]) username: str = payload.get("sub") if username is None: raise credentials_exception except JWTError: raise credentials_exception return username

3.4 保护API端点

修改转写API端点，添加JWT验证：

@app.post("/api/transcribe") async def transcribe( file: UploadFile = File(...), language: str = "auto", use_itn: bool = True, current_user: str = Depends(get_current_user) ): # 原有转写逻辑 ...

4. 请求限流配置

4.1 为什么需要限流

限流可以防止API被过度调用，保护服务稳定性。常见的限流策略包括：

基于IP的限流
基于用户的限流
全局速率限制

4.2 安装限流依赖

pip install slowapi

4.3 配置限流中间件

在app.py中添加限流配置：

from slowapi import Limiter from slowapi.util import get_remote_address limiter = Limiter(key_func=get_remote_address) app.state.limiter = limiter # 全局限流配置 app.add_middleware( SlowAPIMiddleware, limiter=limiter, default_limits=["100 per minute", "10 per second"] ) # 为特定端点设置自定义限流 @app.post("/api/transcribe") @limiter.limit("5/minute") async def transcribe(...): ...

5. 完整安全配置示例

5.1 安全API调用流程

获取访问令牌
使用令牌调用受保护API
遵守速率限制

5.2 获取JWT令牌

curl -X POST "http://localhost:7860/token" \ -H "Content-Type: application/x-www-form-urlencoded" \ -d "username=your_username&password=your_password"

5.3 使用令牌调用API

curl -X POST "http://localhost:7860/api/transcribe" \ -H "Authorization: Bearer your_token_here" \ -F "file=@audio.wav" \ -F "language=auto" \ -F "use_itn=true"