基于Spring AI實現多輪對話系統架構設計

前言

一、多輪對話系統核心架構

1.1 架構概覽

1.2 Spring AI核心優勢

二、ChatClient與多輪對話設計

2.1 ChatClient的特性與角色

2.2 實現多輪對話方法

三、Advisors攔截器機制

3.1 Advisors概念與工作原理

3.2 對話記憶Advisor詳解

四、對話記憶實現方案

4.1 ChatMemory接口

4.2 內存存儲實現

4.3 文件持久化存儲

4.4 數據庫持久化實現

五、自定義增強Advisor實現

5.1 日志記錄Advisor

5.2 內容安全Advisor

5.3 推理增強Advisor

六、結構化輸出與業務應用

6.1 結構化報告生成

6.2 完整對話系統整合

七、性能優化與最佳實踐

7.1 對話記憶性能優化

7.2 多輪對話系統設計要點

總結

基于Spring AI實現多輪對話系統架構設計

前言

隨著大型語言模型（LLM）的迅速發展，構建具有持久記憶和上下文感知能力的對話系統成為AI應用開發的關鍵。Spring AI框架提供了簡潔高效的組件，幫助開發者快速實現這類功能。本文將深入探討如何基于Spring AI打造多輪對話系統，包括架構設計、核心組件和實現方法。

一、多輪對話系統核心架構

1.1 架構概覽

多輪對話系統的核心在于維護對話上下文，使AI能夠理解歷史交互內容，實現連貫對話。基于Spring AI的多輪對話系統架構可概括為：

核心組件包括：

ChatClient：對話客戶端，處理與LLM的交互
Advisor：攔截器鏈，處理請求前后的增強邏輯
ChatMemory：對話記憶，存儲歷史消息
PromptTemplate：提示詞模板，構建結構化提示

1.2 Spring AI核心優勢

基于Spring AI框架開發多輪對話系統具有以下優勢：

鏈式調用API（Fluent API）：簡潔直觀的調用方式
動態參數綁定：支持模板變量，提高靈活性
靈活的響應格式：支持實體映射和流式輸出
可插拔攔截器：通過Advisors機制輕松擴展功能
內置記憶管理：開箱即用的對話記憶組件

二、ChatClient與多輪對話設計

2.1 ChatClient的特性與角色

@Component
@Slf4j
public class LoveApp {
?private static final String SYSTEM_PROMPT = "**戀愛大師·情感導航員**  \n" + "10年情感咨詢經驗，擅長親密關系理論與溝通技巧...";private final ChatClient chatClient;
?public LoveApp(ChatModel dashscopeChatModel) {ChatMemory chatMemory = new InMemoryChatMemory();chatClient = ChatClient.builder(dashscopeChatModel).defaultSystem(SYSTEM_PROMPT).defaultAdvisors(new MessageChatMemoryAdvisor(chatMemory)).build();}
}

ChatClient是Spring AI提供的核心對話客戶端組件，負責維護與大語言模型的通信。它支持：

鏈式調用：簡化API調用流程
動態參數注入：運行時傳遞控制參數
多格式響應處理：文本、JSON、流式回復等
攔截器機制：Advisors模式的擴展點

創建ChatClient示例：

2.2 實現多輪對話方法

通過ChatClient實現多輪對話的關鍵是正確配置對話記憶，并在每次交互中保持會話ID的一致性：

public String doChat(String message, String chatId) {ChatResponse response = chatClient.prompt().user(message).advisors(spec -> spec.param(CHAT_MEMORY_CONVERSATION_ID_KEY, chatId).param(CHAT_MEMORY_RETRIEVE_SIZE_KEY, 10)).call().chatResponse();String content = response.getResult().getOutput().getText();log.info("content: {}", content);return content;
}

這段代碼的關鍵點在于：

用戶消息注入：通過.user(message)添加當前用戶問題
會話標識：通過CHAT_MEMORY_CONVERSATION_ID_KEY參數維護會話一致性
上下文長度：通過CHAT_MEMORY_RETRIEVE_SIZE_KEY控制歷史消息檢索數量
響應處理：獲取并處理模型回復

三、Advisors攔截器機制

3.1 Advisors概念與工作原理

Advisors是Spring AI中基于責任鏈模式實現的攔截器機制，可以在調用大模型前后執行增強邏輯。其核心特性包括：

責任鏈模式：多個攔截器按順序執行
順序控制：通過getOrder()方法控制執行順序
前置/后置處理：可在請求發送前和響應接收后進行處理
可擴展性：通過實現接口自定義攔截器

常用內置Advisor：

MessageChatMemoryAdvisor：維護對話歷史
QuestionAnswerAdvisor：知識庫檢索增強

3.2 對話記憶Advisor詳解

負責維護對話上下文的攔截器主要有兩種：

MessageChatMemoryAdvisor：
- 保留消息的角色（用戶/助手/系統）
- 將歷史消息作為獨立實體注入
- 維護完整對話結構
PromptChatMemoryAdvisor：
- 將歷史對話合并為文本
- 作為系統提示的一部分注入
- 可能丟失消息邊界信息

在多輪對話系統中，通常選擇MessageChatMemoryAdvisor以保留更多上下文信息：

ChatMemory chatMemory = new InMemoryChatMemory();
chatClient = ChatClient.builder(dashscopeChatModel).defaultSystem(SYSTEM_PROMPT).defaultAdvisors(new MessageChatMemoryAdvisor(chatMemory)).build();

四、對話記憶實現方案

4.1 ChatMemory接口

ChatMemory是Spring AI提供的抽象接口，定義了對話記憶的核心操作：

public interface ChatMemory {void add(String conversationId, List<Message> messages);List<Message> get(String conversationId, int lastN);void clear(String conversationId);
}

這三個方法構成了對話記憶的基本功能：

添加：保存新的對話消息
獲取：檢索特定會話的歷史消息
清空：刪除特定會話的所有記錄

4.2 內存存儲實現

最簡單的實現是使用InMemoryChatMemory，適用于開發測試或短期會話：

ChatMemory chatMemory = new InMemoryChatMemory();

內存實現的優缺點：

優點：速度快，配置簡單
缺點：服務重啟數據丟失，不適合生產環境

4.3 文件持久化存儲

對于需要跨服務重啟保存對話的場景，可以實現基于文件的持久化：

@Slf4j
public class FileBasedChatMemory implements ChatMemory {
?private final String baseDir;private static final Kryo kryo;
?static {kryo = new Kryo();kryo.setRegistrationRequired(false);kryo.setInstantiatorStrategy(new StdInstantiatorStrategy());}
?public FileBasedChatMemory(String dir) {this.baseDir = dir;new File(dir).mkdirs();}
?@Overridepublic void add(String conversationId, List<Message> messages) {var existingMessages = getOrCreateConversation(conversationId);existingMessages.addAll(messages);saveConversation(conversationId, existingMessages);}
?@Overridepublic List<Message> get(String conversationId, int lastN) {var allMessages = getOrCreateConversation(conversationId);return allMessages.stream().skip(Math.max(0, allMessages.size() - lastN)).toList();}
?// 其他輔助方法...
}

文件存儲的優缺點：

優點：簡單易實現，無需額外服務
缺點：并發性能有限，不適合高并發場景

4.4 數據庫持久化實現

對于生產環境，尤其是多用戶系統，數據庫存儲是最佳選擇。以MySQL為例：

@Component
@Slf4j
public class MySQLChatMemory implements ChatMemory {
?private final JdbcTemplate jdbcTemplate;private final JSONConfig jsonConfig;
?public MySQLChatMemory(DataSource dataSource) {this.jdbcTemplate = new JdbcTemplate(dataSource);this.jsonConfig = new JSONConfig().setIgnoreNullValue(true);log.info("初始化MySQL對話記憶");}
?@Override@Transactionalpublic void add(String conversationId, List<Message> messages) {// 獲取當前最大序號Integer maxOrder = getMaxOrder(conversationId).orElse(0);int nextOrder = maxOrder + 1;
?// 批量插入消息String insertSql = "INSERT INTO chatmemory (...) VALUES (?, ?, ?, ?, ?, ?, ?, ?)";jdbcTemplate.batchUpdate(insertSql, messages, messages.size(), (ps, message) -> {// 設置參數...});}
?// 其他實現方法...
}

對于更復雜的場景，可以集成ORM框架如MyBatis-Plus：

@Component
@Slf4j
public class MybatisPlusChatMemory implements ChatMemory {
?private final ChatMemoryService chatMemoryService;
?public MybatisPlusChatMemory(ChatMemoryService chatMemoryService) {this.chatMemoryService = chatMemoryService;log.info("初始化Mybatis-Plus對話記憶");}
?@Overridepublic void add(String conversationId, List<Message> messages) {chatMemoryService.addMessages(conversationId, messages);}
?@Overridepublic List<Message> get(String conversationId, int lastN) {return chatMemoryService.getMessages(conversationId, lastN);}
?@Overridepublic void clear(String conversationId) {chatMemoryService.clearMessages(conversationId);}
}

五、自定義增強Advisor實現

5.1 日志記錄Advisor

記錄對話內容的自定義Advisor實現：

@Slf4j
public class MyLoggerAdvisor implements CallAroundAdvisor, StreamAroundAdvisor {
?@Overridepublic String getName() {return getClass().getSimpleName();}
?@Overridepublic int getOrder() {return 0; // 執行順序}
?// 請求前打印用戶輸入private AdvisedRequest before(AdvisedRequest request) {log.info("AI Request: {}", request.userText());return request;}
?// 響應后打印 AI 輸出private void observeAfter(AdvisedResponse response) {log.info("AI Response: {}", response.response().getResult().getOutput().getText());}
?@Overridepublic AdvisedResponse aroundCall(AdvisedRequest req, CallAroundAdvisorChain chain) {req = before(req);AdvisedResponse res = chain.nextAroundCall(req);observeAfter(res);return res;}
?// 流式調用處理@Overridepublic Flux<AdvisedResponse> aroundStream(AdvisedRequest req, StreamAroundAdvisorChain chain) {req = before(req);Flux<AdvisedResponse> res = chain.nextAroundStream(req);return new MessageAggregator().aggregateAdvisedResponse(res, this::observeAfter);}
}

5.2 內容安全Advisor

檢測違禁詞的自定義Advisor實現：

5.3 推理增強Advisor

@Slf4j
public class ProhibitedWordAdvisor implements CallAroundAdvisor, StreamAroundAdvisor {
?private static final String DEFAULT_PROHIBITED_WORDS_FILE = "prohibited-words.txt";private final List<String> prohibitedWords;
?public ProhibitedWordAdvisor() {this.prohibitedWords = loadProhibitedWordsFromFile(DEFAULT_PROHIBITED_WORDS_FILE);log.info("初始化違禁詞Advisor，違禁詞數量: {}", prohibitedWords.size());}
?private AdvisedRequest checkRequest(AdvisedRequest request) {String userText = request.userText();if (containsProhibitedWord(userText)) {log.warn("檢測到違禁詞在用戶輸入中: {}", userText);throw new ProhibitedWordException("用戶輸入包含違禁詞");}return request;}
?@Overridepublic AdvisedResponse aroundCall(AdvisedRequest advisedRequest, CallAroundAdvisorChain chain) {return chain.nextAroundCall(checkRequest(advisedRequest));}
?// 其他輔助方法...
}

提高模型推理能力的自定義Advisor：

public class ReReadingAdvisor implements CallAroundAdvisor, StreamAroundAdvisor {
?private AdvisedRequest before(AdvisedRequest advisedRequest) {Map<String, Object> advisedUserParams = new HashMap<>(advisedRequest.userParams());advisedUserParams.put("re2_input_query", advisedRequest.userText());
?return AdvisedRequest.from(advisedRequest).userText("""{re2_input_query}Read the question again: {re2_input_query}""").userParams(advisedUserParams).build();}
?@Overridepublic AdvisedResponse aroundCall(AdvisedRequest advisedRequest, CallAroundAdvisorChain chain) {return chain.nextAroundCall(this.before(advisedRequest));}
?// 其他方法實現...
}

六、結構化輸出與業務應用

6.1 結構化報告生成

Spring AI支持將模型輸出直接映射為Java對象，實現結構化數據處理：

public LoveReport doChatWithReport(String message, String chatId) {LoveReport loveReport = chatClient.prompt().system(SYSTEM_PROMPT + "每次對話后都要生成戀愛結果，標題為{用戶名}的戀愛報告，內容為建議列表").user(message).advisors(spec -> spec.param(CHAT_MEMORY_CONVERSATION_ID_KEY, chatId).param(CHAT_MEMORY_RETRIEVE_SIZE_KEY, 10)).call().entity(LoveReport.class);log.info("loveReport: {}", loveReport);return loveReport;
}

這種方式可以直接將模型生成的內容解析為Java對象，便于后續業務處理。

6.2 完整對話系統整合

將所有組件整合的多輪對話系統示例：

@Component
@Slf4j
public class EnhancedLoveApp {
?private static final String SYSTEM_PROMPT = "**戀愛大師·情感導航員**...";private final ChatClient chatClient;
?public EnhancedLoveApp(ChatModel dashscopeChatModel, ChatMemoryService chatMemoryService) {// 使用MyBatis-Plus持久化對話記憶ChatMemory chatMemory = new MybatisPlusChatMemory(chatMemoryService);chatClient = ChatClient.builder(dashscopeChatModel).defaultSystem(SYSTEM_PROMPT).defaultAdvisors(// 對話記憶能力new MessageChatMemoryAdvisor(chatMemory),// 記錄日志new MyLoggerAdvisor(),// 違禁詞檢測new ProhibitedWordAdvisor(),// 復讀強化閱讀能力new ReReadingAdvisor()).build();}
?public String doChat(String message, String chatId) {return chatClient.prompt().user(message).advisors(spec -> spec.param(CHAT_MEMORY_CONVERSATION_ID_KEY, chatId).param(CHAT_MEMORY_RETRIEVE_SIZE_KEY, 10)).call().chatResponse().getResult().getOutput().getText();}// 其他業務方法...
}