10.Agent 进阶实践：基于 Planner-Executor-Responder 的多步决策系统实现

news2026/4/9 17:05:38

目录从单步执行到多步决策与状态流转最小循环Agent代码实现工具选择planner执行工具executor执行输出responder从单步执行到多步决策与状态流转最近不知道大家有没有关注一些招聘平台的招聘信息其实看这些平台我相信最多的关键词就Cot、ReAct等今天咱们主要讲ReActReasoning Acting这种范式的提出主要目的是解决纯推理模型易产生“幻觉”和纯行动模型缺乏高层规划的局限性。这个说法有点抽象我想换一种说法这种方式其实是改变智能体的行为方式咱们前面写到东西其实是一种单步智能体他的执行过程是这样的用户问题 → 选一个工具 → 执行 → 返回结果这种方式其实有一点不好就是智能体判断完可能是什么任务后直接就开始执行得到结果就给了用户这种方式得到的结果质量可能一般所以为了提升智能体执行任务返回结果的质量姚顺雨在2020年提出的了这种AI范式其核心在于让大语言模型通过“推理-行动-观察”的循环与外部环境交互以完成复杂任务。‌可以这样表示用户问题 → 思考 → 调工具 → 看结果 → 再思考 → 再决定下一步 → 最终回答最小循环Agent今天我们要做的这样一件事让我们前面写的智能体先判断用什么工具调一次工具拿到结果后在做一次总结返回最终答案也就是实现状态节点流转Question → Tool Choice → Tool Output → Final Answer代码实现咱们需要修改咱们前面写的agent.py文件。工具选择planner首先对咱们的的工具选择进行一定的改造前面咱们的工具选择是从已定义的工具中选择合适的工具回答问题这次咱们再补充一点如果没有合适的工具那么使用大语言模型回答问题defchoose_tool(query,tools):tool_desc\n.join([f{t[name]}:{t[description]}fortintools])promptf You are an AI agent. Available tools:{tool_desc}User question:{query}Return JSON: {{tool: ..., input: ...}} responseclient.chat.completions.create(modeldeepseek-chat,messages[{role:user,content:prompt}])contentresponse.choices[0].message.contenttry:returnjson.loads(content)except:return{tool:llm,input:query}执行工具executor定义执行工具函数该函数将会上一步选择工具执行同时返回响应的中间执行结果defexecute_tool(decision,tools,ragNone):tool_namedecision[tool]tool_inputdecision[input]fortintools:ift[name]tool_name:iftool_namerag:resultt[func](tool_input,rag)else:resultt[func](tool_input)return{tool_name:tool_name,tool_input:tool_input,tool_output:result}return{tool_name:none,tool_input:tool_input,tool_output:No valid tool found.}执行输出responder咱们前面已经选择的工具并且得到了中间的执行结果随后智能体将会观察执行结果在执行结果的基础上再进行一步思考得出最终执行结果defgenerate_final_answer(query,tool_result):promptf You are an AI assistant The user asked:{query}A tool was used: Tool name:{tool_result[tool_name]}Tool input:{tool_result[tool_input]}Tool output:{tool_result[tool_output]}Now provide a final helpful answer to the user. responseclient.chat.completions.create(modeldeepseek-chat,messages[{role:user,content:prompt}])returnresponse.choices[0].message.content最后就是将整个流程穿起来咱们就得到一个多步决策智能体了defrun_agent(query,tools,ragNone):print( Agent Start )print(User query:,query)decisionchoose_tool(query,tools)print(Decision:,decision)tool_resultexecute_tool(decision,tools,rag)print(Tool result:,tool_result)final_answergenerate_final_answer(query,tool_result)print( Agent End )returnfinal_answer这里有个小坑咱们在选择工具的时候有写这样一段try:returnjson.loads(content)except:return{tool:llm,input:query}所以咱们还需要再tool.py文件中补一个工具llm_toolfromdatetimeimportdatetimefromllm_utilsimportclientdefrag_tool(query,rag):returnrag.ask(query)defcalculator_tool(expression):try:returnstr(eval(expression))except:returnInvalid expressiondeftime_tool(_):returndatetime.now().strftime(%Y-%m-%d %H:%M:%S)defllm_tool(query):responseclient.chat.completions.create(modeldeepseek-chat,messages[{role:system,content:You are a helpful assistant.},{role:user,content:query}])returnresponse.choices[0].message.content TOOLS[{name:rag,description:Use for paper/document questions,func:rag_tool},{name:calculator,description:Use for math calculations,func:calculator_tool},{name:time,description:Use to get current time,func:time_tool},{name:llm,description:Use for general questions,func:llm_tool}]至此咱们的代码升级完成前面的咱们的代码是选工具 → 直接返回工具结果到今天咱们的代码实现了选工具 → 执行工具 → 工具结果作为 observation → LLM 再组织最终答案如果这篇文章对你有帮助可以点个赞完整代码地址https://github.com/1186141415/A-Paper-Rag-Agent

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/2500076.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！