MCP入门指南：AI领域“万能插座“详解，程序员必学必收藏

本文详细介绍了MCP（模型上下文协议）是Anthropic公司推出的开放标准协议，旨在解决大模型与外部工具、数据源的交互问题。MCP采用client-host-server架构，通过标准化通信实现AI应用与多样化外部资源的连接。文章解释了MCP的工作原理、架构组件、与LLM的交互流程，并提供了创建MCP服务器的示例。MCP servers提供工具、资源和提示三种功能，可大幅简化AI应用开发，增强A

AI产品经理

739人浏览 · 2026-01-14 16:00:15

AI产品经理 · 2026-01-14 16:00:15 发布

前排提示，文末有大模型AGI-CSDN独家资料包哦！

什么是MCP

MCP（Model Context Protocol，模型上下文协议）是 Anthropic 公司于 2024 年底推出的一种开放标准协议，旨在解决大型语言模型（LLM）与外部工具、数据源之间的交互壁垒。MCP其核心定位是成为 AI 领域的“通用接口”或“万能插座”，通过标准化通信实现大模型与多样化外部资源（如数据库、文件系统、API 等）的安全、高效连接。

官方的解释:

“The Model Context Protocol (MCP) follows a client-host-server architecture.”

MCP Server本质上就是一段Nodejs或者Python程序，大模型通过操作系统的STDIO，也就是标准输入输出通道，或者SSE协议调用某个MCP Server。

这里顺便提下Function Calling、MCP和AI Agent的区别和定位，如下

Function Calling：是AI模型调用函数的机制。

MCP：是一个标准协议，使大模型与API无缝交互。

AI Agent：是一个自主运行的智能系统，利用Function Calling和MCP来分析和执行任务，实现特定目标。

MCP是如何工作的

想要更深入知道里面的工作原理，看官方文档和源码

源码仓库：https://github.com/modelcontextprotocol[1]

官方文档：https://modelcontextprotocol.io/[2]

MCP提供了多种语言的sdk，大家可以根据自己的语言特长选择相应的sdk

总体架构图

●

MCP Hosts: 如 Claude Desktop、IDE 或 AI 工具等希望通过 MCP 访问数据的程序。

●

MCP Clients:与服务器保持 1:1 连接的协议客户端。

●

MCP Servers: 通过标准化的模型上下文协议暴露特定功能的轻量级程序。

●

Local Data Sources: 你的计算机文件、数据库和服务，MCP 服务器可以安全访问。

●

Remote Services: 可通过互联网（例如通过 API）访问的外部系统，MCP 服务器可与之连接。

MCP和大模型的交互流程

MCP是核心是提供可插拔的多个工具支持LLM来调用，那么问题来了，LLM（模型）是在什么时候确定使用哪些工具的呢？ Anthropic 为我们提供了详细的解释，当用户在mcp客户端发起一个问题时：

客户端（Claude Desktop / Cursor）将问题发送给 LLM。

LLM 分析可用的工具，并决定使用哪一个（或多个）。

客户端通过 MCP Server 执行所选的工具。

工具的执行结果被送回给 LLM。

LLM 结合执行结果，归纳总结后得到prompt并生成自然语言展示给用户！

通过MCP client端的示例代码我们也可以大概知道情况

Python

async def start(self):
    # 初始化配置的 mcp server
    for server in self.servers:
        await server.initialize()

    # 获取配置的每一个mcp server的 tools 命名为 all_tools
    all_tools = []
    for server in self.servers:
        tools = await server.list_tools()
        all_tools.extend(tools)

    # 获取tool上的提示词并提供 给LLM 使用
    tools_description = "\n".join(
        [tool.format_for_llm() for tool in all_tools]
    )

    # 询问 LLM（Claude） 应该使用哪些工具。
    system_message = (
        "You are a helpful assistant with access to these tools:\n\n"
        f"{tools_description}\n"
        "Choose the appropriate tool based on the user's question. "
        "If no tool is needed, reply directly.\n\n"
        "IMPORTANT: When you need to use a tool, you must ONLY respond with "
        "the exact JSON object format below, nothing else:\n"
        "{\n"
        '    "tool": "tool-name",\n'
        '    "arguments": {\n'
        '        "argument-name": "value"\n'
        "    }\n"
        "}\n\n"
        "After receiving a tool's response:\n"
        "1. Transform the raw data into a natural, conversational response\n"
        "2. Keep responses concise but informative\n"
        "3. Focus on the most relevant information\n"
        "4. Use appropriate context from the user's question\n"
        "5. Avoid simply repeating the raw data\n\n"
        "Please use only the tools that are explicitly defined above."
    )
    messages = [{"role": "system", "content": system_message}]

    while True:
        #追加用户消息输入.
        messages.append({"role": "user", "content": user_input})

        # 将 system_message（针对mcp调用的提示词） 和用户消息输入一起发送给 LLM
        llm_response = self.llm_client.get_response(messages)

    ... # 后面和确定使用哪些工具无关

class Tool:
    """Represents a tool with its properties and formatting."""

    def __init__(
        self, name: str, description: str, input_schema: dict[str, Any]
    ) -> None:
        self.name: str = name
        self.description: str = description
        self.input_schema: dict[str, Any] = input_schema

    # 把工具的名字 / 工具的用途（description）和工具所需要的参数（args_desc）转化为文本
    def format_for_llm(self) -> str:
        """Format tool information for LLM.
 
         Returns:
             A formatted string describing the tool.
         """
        args_desc = []

MCP的作用

借助 MCP，像 Claude 或 ChatGPT 这样的人工智能应用程序可以连接到数据源（例如本地文件、数据库）、工具（例如搜索引擎、计算器）和工作流程（例如特定提示），从而访问关键信息并执行任务。

开发者：MCP 可以减少构建 AI 应用程序或代理或与之集成时的开发时间和复杂性。

AI 应用或代理：MCP 提供对数据源、工具和应用程序生态系统的访问，这将增强功能并改善最终用户体验。

最终用户：MCP 可产生功能更强大的 AI 应用程序或代理，这些应用程序或代理可以访问您的数据并在必要时代表您采取行动。

说人话：可以将 MCP 想象成人工智能应用程序的 USB-C 接口。正如 USB-C 提供了一种连接电子设备的标准化方式一样，MCP 也提供了一种将人工智能应用程序连接到外部系统的标准化方式。

如何使用MCP

如果我们自己手动创建一个MCP server应该能够很好的理解其实市面上已经有大量的MCP server了）

下面我们具体来搭建一个简单的加法运算的MCP，越简单越好：

创建文件main.py，内容如下：

# server.py
from mcp.server.fastmcp import FastMCP
# Create an MCP server
mcp = FastMCP("Demo")
# Add an addition tool
@mcp.tool()
def add(a: int, b: int) -> int:
"""Add two numbers"""
return a + b
# Add a dynamic greeting resource
@mcp.resource("greeting://{name}")
def get_greeting(name: str) -> str:
"""Get a personalized greeting"""
return f"Hello, {name}!"
if __name__ == "__main__":
mcp.run(transport='sse')

启动服务