百度大模型開源
本地啟動手冊
安裝依賴:
python -m pip install paddlepaddle-gpu==3.1.0 -i https://www.paddlepaddle.org.cn/packages/stable/cu126/python -m pip install fastdeploy-gpu -i https://www.paddlepaddle.org.cn/packages/stable/fastdeploy-gpu-80_90/ --extra-index-url https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simpleapt install libgomp1
啟動 API
python -m fastdeploy.entrypoints.openai.api_server \--model baidu/ERNIE-4.5-0.3B-Paddle \--port 8180 \--metrics-port 8181 \--engine-worker-queue-port 8182 \--max-model-len 32768 \--max-num-seqs 32
大概等待10分鐘左右
測試 Python 代碼
import requests
import jsonurl = "http://127.0.0.1:8180/v1/chat/completions"
headers = {"Content-Type": "application/json"}data = {"model": "baidu/ERNIE-4.5-0.3B-PT","messages": [{"role": "user", "content": "我是仕宇2050,你幫我寫一個Python函數判斷字符串是否為回文串"}],"temperature": 0.7
}response = requests.post(url, headers=headers, data=json.dumps(data))
print(response.json()["choices"][0]["message"]["content"])
模型開源地址:https://ai.gitcode.com/theme/1939325484087291906
視頻演示: 全網同名 #JavaPub #仕宇2050
循環對話的python代碼
import requests
import jsondef chat():url = "http://127.0.0.1:8180/v1/chat/completions"headers = {"Content-Type": "application/json"}model = "baidu/ERNIE-4.5-0.3B-PT"# 保存對話歷史messages = []print("開始對話吧!(輸入 'exit' 或 'quit' 退出)")while True:user_input = input("\n你:")if user_input.lower() in {"exit", "quit"}:print("結束對話。")break# 添加用戶消息到對話歷史messages.append({"role": "user", "content": user_input})# 構建請求體data = {"model": model,"messages": messages}try:response = requests.post(url, headers=headers, data=json.dumps(data))response.raise_for_status()result = response.json()if "choices" in result and len(result["choices"]) > 0:ai_message = result["choices"][0]["message"]["content"]print("AI:", ai_message)# 將AI的回復加入對話歷史messages.append({"role": "assistant", "content": ai_message})else:print("AI 沒有返回內容。")except requests.exceptions.RequestException as e:print(f"請求錯誤: {e}")except json.JSONDecodeError:print(f"JSON解析錯誤,響應內容: {response.text}")except Exception as e:print(f"發生錯誤: {e}")if __name__ == "__main__":chat()