LiteLLM部署手册

博主： L
发布时间：2026 年 07 月 03 日
9 次浏览
暂无评论
6122字数
分类：部署手册

准备工作

系统为openEuler 22.03 ARM架构 (x86也可以用，但是要自行修改镜像地址)

cat /etc/os-release
NAME="openEuler"
VERSION="22.03 (LTS-SP4)"
ID="openEuler"
VERSION_ID="22.03"
PRETTY_NAME="openEuler 22.03 (LTS-SP4)"
ANSI_COLOR="0;31"

已安装docker compose

Docker version 29.5.3, build d1c06ef 
Docker Compose version v5.1.4

LiteLLM版本号为v1.89.1
vLLM已搭建，并且可以用curl请求，测试请求格式范例如下：

chat

curl http://127.0.0.1:8000/v1/chat/completions -H "Content-Type: application/json" -d '{"model":"qwen3","messages":[{"role":"user","content":"你好，请简单介绍下自己"}],"temperature":0.7,"max_tokens":1024}'

embedding

curl http://127.0.0.1:9010/v1/embeddings -H "Content-Type: application/json" -d '{"model":"/model","input":"测试文本向量化"}'

rerank

curl http://127.0.0.1:9011/v1/score -H "Content-Type: application/json" -d '{"model":"/model","pairs":[["人工智能是什么","人工智能是模拟人类智能的技术"],["人工智能是什么","今天下雨了"]]}'

一、目录结构

litellm/

├── .env

├── docker-compose.yml

└── prometheus.yml

二、创建配置文件

mkdir litellm && cd litellm

.env中的LITELLM_MASTER_KEY为默认登入密码，SEARXNG_API_BASE为可选配置

cat > .env << 'EOF'
LITELLM_MASTER_KEY="sk-c0oE4EgitT4DmXyRj4Av"
SEARXNG_API_BASE="http://192.168.105.143:18080"
EOF

cat > docker-compose.yml << 'EOF'
services:
  litellm:
    build:
      context: .
      args:
        target: runtime
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/docker.litellm.ai/berriai/litellm:main-stable-linuxarm64
    #########################################
    ## Uncomment these lines to start proxy with a config.yaml file ##
    # volumes:
    #  - ./config.yaml:/app/config.yaml
    # command:
    #  - "--config=/app/config.yaml"
    ##############################################
    ports:
      - "3000:4000" # Map the container port to the host, change the host port if necessary
    environment:
      DATABASE_URL: "postgresql://llmproxy:dbpassword9090@db:5432/litellm"
      # Optional: route read-only queries (find_*, count, group_by, query_raw/_first)
      # to a separate reader endpoint, e.g. an Aurora reader. Leave unset for
      # single-DB deployments. With IAM_TOKEN_DB_AUTH enabled, the reader URL
      # is auto-refreshed alongside the writer.
      # DATABASE_URL_READ_REPLICA: "postgresql://llmproxy:dbpassword9090@db-reader:5432/litellm"
      STORE_MODEL_IN_DB: "True" # allows adding models to proxy via UI
    env_file:
      - .env # Load local .env file
    depends_on:
      - db  # Indicates that this service depends on the 'db' service, ensuring 'db' starts first
    healthcheck:  # Defines the health check configuration for the container
      test:
        - CMD-SHELL
        - python3 -c "import urllib.request; urllib.request.urlopen('http://localhost:4000/health/liveliness')"  # Command to execute for health check
      interval: 30s  # Perform health check every 30 seconds
      timeout: 10s   # Health check command times out after 10 seconds
      retries: 3     # Retry up to 3 times if health check fails
      start_period: 40s  # Wait 40 seconds after container start before beginning health checks

  db:
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/postgres:16-linuxarm64
    restart: always
    container_name: litellm_db
    environment:
      POSTGRES_DB: litellm
      POSTGRES_USER: llmproxy
      POSTGRES_PASSWORD: dbpassword9090
    ports:
      - "5432:5432"
    volumes:
      - postgres_data:/var/lib/postgresql/data # Persists Postgres data across container restarts
    healthcheck:
      test: ["CMD-SHELL", "pg_isready -d litellm -U llmproxy"]
      interval: 1s
      timeout: 5s
      retries: 10

  prometheus:
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/prom/prometheus:v3.5.4-linuxarm64
    volumes:
      - prometheus_data:/prometheus
      - ./prometheus.yml:/etc/prometheus/prometheus.yml
    ports:
      - "9090:9090"
    command:
      - "--config.file=/etc/prometheus/prometheus.yml"
      - "--storage.tsdb.path=/prometheus"
      - "--storage.tsdb.retention.time=15d"
    restart: always

volumes:
  prometheus_data:
    driver: local
  postgres_data:
    name: litellm_postgres_data # Named volume for Postgres data persistence
EOF

cat > prometheus.yml << 'EOF'
global:
  scrape_interval: 15s

scrape_configs:
  - job_name: 'litellm'
    static_configs:
      - targets: ['litellm:4000']  # Assuming Litellm exposes metrics at port 4000

EOF

三、启动LiteLLM

docker compose up -d

正常启动后可以访问http://localhost:3000/ui来访问后台管理web

四、配置web

1、配置模型地址

Models + Endpoints ---> Add Model

表单配置如下:

字段名	值	说明
Provider	vllm	上游模型供应商类型
LiteLLM Model Name(s)	qwen3	上游模型名称，根据vllm的docker启动配置的名称
Model Mappings->Public Model Name	qwen3-public	提供给下游平台的暴露模型名称
Model Mappings->LiteLLM Model Name	默认值	上面的填好了这里自动完成，正常和上面配置的字符串相等
Mode	Chat - /chat/completions	聊天模型选择chat，embedding和rerank根据类型自行选择
Existing Credentials	留空
API Base	http://127.0.0.1:8000/v1	api地址，路由填写到v1
API Key	留空	没配置秘钥就不用填写

配置完成后点击Test Connect测试连接是否成功，成功后保存

2、配置用户组

Teams ---> Create Team

表单配置如下:

字段名	值	说明
Team Name	test-team	自行定义组名
Models	qwen3-public	根据上面配置的对外服务模型名称选择

其余字段全部留空或者默认

3、配置用户

Internal Users ---> Invite User

表单配置如下:

字段名	值	说明
User Email	test-person	用户名，可以不是email地址，也支持中文
Global Proxy Role	Internal User (View Only)	可选，这里默认是只可使用模型的用户，无管理权限
Team	test-team	用户组名，根据上面配置好的组名选择

其余字段全部留空或者默认

4、配置API Key

Virtual Keys ---> Create New Key

表单配置如下:

字段名	值	说明
Owned By	Another User	配置给用户使用的key，还有其他的比如配置给服务或者agent
User ID	test-person	上面创建的用户名
Organization	留空
Team	test-team	上面创建的用户组名
Key Name	test-key	秘钥名称，在log或usage中显示的名称
Models	All team Models	秘钥可以使用的模型名称，这里可以选所有可用模型，或者单个模型，单个模型场景下可以针对每个用户每个模型单独配额
Key Type	AI APIs	秘钥权限

其余字段全部留空或者默认

保存后会跳出对话框显示秘钥，注意此处的秘钥有且仅有一次机会可以复制，后续都会变为密文形式

我这里的秘钥是sk-mHz0miMx_QyR6xAhxVmSgw

五、测试连通性

使用以下curl测试，key和model字段，替换为上面的值

curl http://localhost:3000/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer sk-mHz0miMx_QyR6xAhxVmSgw" -d '{"model": "qwen3-public", "messages": [{"role": "user", "content": "你好，请介绍一下你自己"}]}'

正常的话，结果如下

[root@localhost gongwen]# curl http://localhost:3000/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer sk-mHz0miMx_QyR6xAhxVmSgw" -d '{"model": "qwen3-public", "messages": [{"role": "user", "content": "你好，请介绍一下你自己"}]}'
{"id":"chatcmpl-bfbabeb7f39ecddb","created":1782720274,"model":"qwen3-public","object":"chat.completion","choices":[{"finish_reason":"stop","index":0,"message":{"content":"你好！我是通义千问（Qwen），是阿里巴巴集团旗下的通义实验室研发的超大规模语言模型。我能够回答问题、创作文字，比如写故事、公文、邮件、剧本，进行逻辑推理、编程，甚至能玩游戏、表达观点和聊天。\n\n我的能力包括但不限于：\n\n- 多语言支持：我支持包括中文、英文、法语、西班牙语、葡萄牙语、俄语、阿拉伯语、日语、韩语、越南语、泰语、印尼语等在内的数十种语言。\n- 代码写作：我熟悉多种编程语言，可以帮你写代码、调试、解释算法。\n- 逻辑推理：我能处理复杂的逻辑问题，比如数学题、谜题、推理题等。\n- 长文本处理：我支持超长上下文（最高可达32768个token），适合处理长文档、书籍摘要、会议记录等。\n- 对话理解：我擅长多轮对话，能记住上下文，提供连贯、自然的交流体验。\n- 知识丰富：我训练数据截至2024年，涵盖广泛领域，能为你提供最新、最全面的信息（在训练数据范围内）。\n\n我叫“通义千问”，“通义”代表我具有广泛的知识和普适性，“千问”则代表我能够回答各种各样的问题，无论多么复杂或独特。\n\n如果你有任何问题或需要帮助，尽管告诉我，我会尽力为你提供支持！😊\n\n—— 你的AI助手 Qwen","role":"assistant","provider_specific_fields":{"refusal":null,"reasoning":null}},"provider_specific_fields":{"token_ids":null,"stop_reason":null}}],"usage":{"completion_tokens":318,"prompt_tokens":12,"total_tokens":330}}[root@localhost gongwen]#

最后修改：2026 年 07 月 03 日

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

LiteLLM部署手册

L • 2026 年 07 月 03 日

<h1>准备工作</h1><ul><li><p>系统为openEuler 22.03 ARM架构 (x86也可以用，但是要自行修改镜像地址)</p><pre><code class="lang-bash">cat /etc/os-release
NAME=&quot;openEuler&quot;
VERSION=&quot;22.03 (LTS-SP4)&quot;
ID=&quot;openEuler&quot;
VERSION_ID=&quot;22.03&quot;
PRETTY_NAME=&quot;openEuler 22.03 (LTS-SP4)&quot;
ANSI_COLOR=&quot;0;31&quot;</code></pre></li><li><p>已安装docker compose</p><pre><code class="lang-bash">Docker version 29.5.3, build d1c06ef 
Docker Compose version v5.1.4</code></pre></li><li>LiteLLM版本号为<code>v1.89.1</code></li><li>vLLM已搭建，并且可以用curl请求，测试请求格式范例如下：</li></ul><p>chat</p><pre><code class="lang-bash">curl http://127.0.0.1:8000/v1/chat/completions -H &quot;Content-Type: application/json&quot; -d '{&quot;model&quot;:&quot;qwen3&quot;,&quot;messages&quot;:[{&quot;role&quot;:&quot;user&quot;,&quot;content&quot;:&quot;你好，请简单介绍下自己&quot;}],&quot;temperature&quot;:0.7,&quot;max_tokens&quot;:1024}'</code></pre><p>embedding</p><pre><code class="lang-bash">curl http://127.0.0.1:9010/v1/embeddings -H &quot;Content-Type: application/json&quot; -d '{&quot;model&quot;:&quot;/model&quot;,&quot;input&quot;:&quot;测试文本向量化&quot;}'</code></pre><p>rerank</p><pre><code class="lang-bash">curl http://127.0.0.1:9011/v1/score -H &quot;Content-Type: application/json&quot; -d '{&quot;model&quot;:&quot;/model&quot;,&quot;pairs&quot;:[[&quot;人工智能是什么&quot;,&quot;人工智能是模拟人类智能的技术&quot;],[&quot;人工智能是什么&quot;,&quot;今天下雨了&quot;]]}'</code></pre><h1>一、目录结构</h1><p>litellm/</p><p>├── .env</p><p>├── docker-compose.yml</p><p>└── prometheus.yml</p><h1>二、创建配置文件</h1><pre><code class="lang-bash">mkdir litellm &amp;&amp; cd litellm</code></pre><p><code>.env</code>中的<code>LITELLM_MASTER_KEY</code>为默认登入密码，<code>SEARXNG_API_BASE</code>为可选配置</p><pre><code class="lang-bash">cat &gt; .env &lt;&lt; 'EOF'
LITELLM_MASTER_KEY=&quot;sk-c0oE4EgitT4DmXyRj4Av&quot;
SEARXNG_API_BASE=&quot;http://192.168.105.143:18080&quot;
EOF</code></pre><pre><code class="lang-bash">cat &gt; docker-compose.yml &lt;&lt; 'EOF'
services:
  litellm:
    build:
      context: .
      args:
        target: runtime
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/docker.litellm.ai/berriai/litellm:main-stable-linuxarm64
    #########################################
    ## Uncomment these lines to start proxy with a config.yaml file ##
    # volumes:
    #  - ./config.yaml:/app/config.yaml
    # command:
    #  - &quot;--config=/app/config.yaml&quot;
    ##############################################
    ports:
      - &quot;3000:4000&quot; # Map the container port to the host, change the host port if necessary
    environment:
      DATABASE_URL: &quot;postgresql://llmproxy:dbpassword9090@db:5432/litellm&quot;
      # Optional: route read-only queries (find_*, count, group_by, query_raw/_first)
      # to a separate reader endpoint, e.g. an Aurora reader. Leave unset for
      # single-DB deployments. With IAM_TOKEN_DB_AUTH enabled, the reader URL
      # is auto-refreshed alongside the writer.
      # DATABASE_URL_READ_REPLICA: &quot;postgresql://llmproxy:dbpassword9090@db-reader:5432/litellm&quot;
      STORE_MODEL_IN_DB: &quot;True&quot; # allows adding models to proxy via UI
    env_file:
      - .env # Load local .env file
    depends_on:
      - db  # Indicates that this service depends on the 'db' service, ensuring 'db' starts first
    healthcheck:  # Defines the health check configuration for the container
      test:
        - CMD-SHELL
        - python3 -c &quot;import urllib.request; urllib.request.urlopen('http://localhost:4000/health/liveliness')&quot;  # Command to execute for health check
      interval: 30s  # Perform health check every 30 seconds
      timeout: 10s   # Health check command times out after 10 seconds
      retries: 3     # Retry up to 3 times if health check fails
      start_period: 40s  # Wait 40 seconds after container start before beginning health checks

db:
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/postgres:16-linuxarm64
    restart: always
    container_name: litellm_db
    environment:
      POSTGRES_DB: litellm
      POSTGRES_USER: llmproxy
      POSTGRES_PASSWORD: dbpassword9090
    ports:
      - &quot;5432:5432&quot;
    volumes:
      - postgres_data:/var/lib/postgresql/data # Persists Postgres data across container restarts
    healthcheck:
      test: [&quot;CMD-SHELL&quot;, &quot;pg_isready -d litellm -U llmproxy&quot;]
      interval: 1s
      timeout: 5s
      retries: 10

prometheus:
    image: swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/prom/prometheus:v3.5.4-linuxarm64
    volumes:
      - prometheus_data:/prometheus
      - ./prometheus.yml:/etc/prometheus/prometheus.yml
    ports:
      - &quot;9090:9090&quot;
    command:
      - &quot;--config.file=/etc/prometheus/prometheus.yml&quot;
      - &quot;--storage.tsdb.path=/prometheus&quot;
      - &quot;--storage.tsdb.retention.time=15d&quot;
    restart: always

volumes:
  prometheus_data:
    driver: local
  postgres_data:
    name: litellm_postgres_data # Named volume for Postgres data persistence
EOF</code></pre><pre><code class="lang-bash">cat &gt; prometheus.yml &lt;&lt; 'EOF'
global:
  scrape_interval: 15s

scrape_configs:
  - job_name: 'litellm'
    static_configs:
      - targets: ['litellm:4000']  # Assuming Litellm exposes metrics at port 4000

EOF</code></pre><h1>三、启动LiteLLM</h1><pre><code class="lang-bash">docker compose up -d</code></pre><p>正常启动后可以访问<code>http://localhost:3000/ui</code>来访问后台管理web</p><h1>四、配置web</h1><h2>1、配置模型地址</h2><p>Models + Endpoints ---&gt; Add Model</p><p>表单配置如下:</p><table><thead><tr><th>字段名</th><th>值</th><th>说明</th></tr></thead><tbody><tr><td>Provider</td><td>vllm</td><td>上游模型供应商类型</td></tr><tr><td>LiteLLM Model Name(s)</td><td>qwen3</td><td>上游模型名称，根据vllm的docker启动配置的名称</td></tr><tr><td>Model Mappings-&gt;Public Model Name</td><td>qwen3-public</td><td>提供给下游平台的暴露模型名称</td></tr><tr><td>Model Mappings-&gt;LiteLLM Model Name</td><td>默认值</td><td>上面的填好了这里自动完成，正常和上面配置的字符串相等</td></tr><tr><td>Mode</td><td>Chat - /chat/completions</td><td>聊天模型选择chat，embedding和rerank根据类型自行选择</td></tr><tr><td>Existing Credentials</td><td>留空</td><td> </td></tr><tr><td>API Base</td><td><span class="external-link"><a class="no-external-link" href="http://127.0.0.1:8000/v1" target="_blank"><i data-feather="external-link"></i>http://127.0.0.1:8000/v1</a></span></td><td>api地址，路由填写到v1</td></tr><tr><td>API Key</td><td>留空</td><td>没配置秘钥就不用填写</td></tr></tbody></table><p>配置完成后点击<code>Test Connect</code>测试连接是否成功，成功后保存</p><h2>2、配置用户组</h2><p>Teams ---&gt; Create Team</p><p>表单配置如下:</p><table><thead><tr><th>字段名</th><th>值</th><th>说明</th></tr></thead><tbody><tr><td>Team Name</td><td>test-team</td><td>自行定义组名</td></tr><tr><td>Models</td><td>qwen3-public</td><td>根据上面配置的对外服务模型名称选择</td></tr></tbody></table><p>其余字段全部留空或者默认</p><h2>3、配置用户</h2><p>Internal Users ---&gt; Invite User</p><p>表单配置如下:</p><table><thead><tr><th>字段名</th><th>值</th><th>说明</th></tr></thead><tbody><tr><td>User Email</td><td>test-person</td><td>用户名，可以不是email地址，也支持中文</td></tr><tr><td>Global Proxy Role</td><td>Internal User (View Only)</td><td>可选，这里默认是只可使用模型的用户，无管理权限</td></tr><tr><td>Team</td><td>test-team</td><td>用户组名，根据上面配置好的组名选择</td></tr></tbody></table><p>其余字段全部留空或者默认</p><h2>4、配置API Key</h2><p>Virtual Keys ---&gt; Create New Key</p><p>表单配置如下:</p><table><thead><tr><th>字段名</th><th>值</th><th>说明</th></tr></thead><tbody><tr><td>Owned By</td><td>Another User</td><td>配置给用户使用的key，还有其他的比如配置给服务或者agent</td></tr><tr><td>User ID</td><td>test-person</td><td>上面创建的用户名</td></tr><tr><td>Organization</td><td>留空</td><td> </td></tr><tr><td>Team</td><td>test-team</td><td>上面创建的用户组名</td></tr><tr><td>Key Name</td><td>test-key</td><td>秘钥名称，在log或usage中显示的名称</td></tr><tr><td>Models</td><td>All team Models</td><td>秘钥可以使用的模型名称，这里可以选所有可用模型，或者单个模型，单个模型场景下可以针对每个用户每个模型单独配额</td></tr><tr><td>Key Type</td><td>AI APIs</td><td>秘钥权限</td></tr></tbody></table><p>其余字段全部留空或者默认</p><p>保存后会跳出对话框显示秘钥，<mark>注意此处的秘钥有且仅有一次机会可以复制，后续都会变为密文形式</mark></p><p>我这里的秘钥是<code>sk-mHz0miMx_QyR6xAhxVmSgw</code></p><h1>五、测试连通性</h1><p>使用以下curl测试，key和model字段，替换为上面的值</p><pre><code class="lang-bash">curl http://localhost:3000/v1/chat/completions -H &quot;Content-Type: application/json&quot; -H &quot;Authorization: Bearer sk-mHz0miMx_QyR6xAhxVmSgw&quot; -d '{&quot;model&quot;: &quot;qwen3-public&quot;, &quot;messages&quot;: [{&quot;role&quot;: &quot;user&quot;, &quot;content&quot;: &quot;你好，请介绍一下你自己&quot;}]}'</code></pre><p>正常的话，结果如下</p><pre><code class="lang-bash">[root@localhost gongwen]# curl http://localhost:3000/v1/chat/completions -H &quot;Content-Type: application/json&quot; -H &quot;Authorization: Bearer sk-mHz0miMx_QyR6xAhxVmSgw&quot; -d '{&quot;model&quot;: &quot;qwen3-public&quot;, &quot;messages&quot;: [{&quot;role&quot;: &quot;user&quot;, &quot;content&quot;: &quot;你好，请介绍一下你自己&quot;}]}'
{&quot;id&quot;:&quot;chatcmpl-bfbabeb7f39ecddb&quot;,&quot;created&quot;:1782720274,&quot;model&quot;:&quot;qwen3-public&quot;,&quot;object&quot;:&quot;chat.completion&quot;,&quot;choices&quot;:[{&quot;finish_reason&quot;:&quot;stop&quot;,&quot;index&quot;:0,&quot;message&quot;:{&quot;content&quot;:&quot;你好！我是通义千问（Qwen），是阿里巴巴集团旗下的通义实验室研发的超大规模语言模型。我能够回答问题、创作文字，比如写故事、公文、邮件、剧本，进行逻辑推理、编程，甚至能玩游戏、表达观点和聊天。\n\n我的能力包括但不限于：\n\n- 多语言支持：我支持包括中文、英文、法语、西班牙语、葡萄牙语、俄语、阿拉伯语、日语、韩语、越南语、泰语、印尼语等在内的数十种语言。\n- 代码写作：我熟悉多种编程语言，可以帮你写代码、调试、解释算法。\n- 逻辑推理：我能处理复杂的逻辑问题，比如数学题、谜题、推理题等。\n- 长文本处理：我支持超长上下文（最高可达32768个token），适合处理长文档、书籍摘要、会议记录等。\n- 对话理解：我擅长多轮对话，能记住上下文，提供连贯、自然的交流体验。\n- 知识丰富：我训练数据截至2024年，涵盖广泛领域，能为你提供最新、最全面的信息（在训练数据范围内）。\n\n我叫“通义千问”，“通义”代表我具有广泛的知识和普适性，“千问”则代表我能够回答各种各样的问题，无论多么复杂或独特。\n\n如果你有任何问题或需要帮助，尽管告诉我，我会尽力为你提供支持！😊\n\n—— 你的AI助手 Qwen&quot;,&quot;role&quot;:&quot;assistant&quot;,&quot;provider_specific_fields&quot;:{&quot;refusal&quot;:null,&quot;reasoning&quot;:null}},&quot;provider_specific_fields&quot;:{&quot;token_ids&quot;:null,&quot;stop_reason&quot;:null}}],&quot;usage&quot;:{&quot;completion_tokens&quot;:318,&quot;prompt_tokens&quot;:12,&quot;total_tokens&quot;:330}}[root@localhost gongwen]#</code></pre>

LiteLLM部署手册

准备工作

一、目录结构

二、创建配置文件

三、启动LiteLLM

四、配置web

1、配置模型地址

2、配置用户组

3、配置用户

4、配置API Key

五、测试连通性

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

GitKraken 6.5.1

痒痒鼠樱饼扫地工V1.0.3-阴阳师自动刷魂土肝绘卷工具分享

CentOS-7 配置全局http代理

OpenAI gym 用户自定义环境及模型训练（一）

[搬运]OpenSSL 自签 CA 及 SSL 证书

CentOS 7 docker搭建

Centos7下安装Aria2教程之 Aria2 一键安装管理脚本

vLLM部署手册

waf流量清洗nginx反向代理+ModSecurity(二)

GitKraken 6.5.1

LiteLLM部署手册

准备工作

一、目录结构

二、创建配置文件

三、启动LiteLLM

四、配置web

1、配置模型地址

2、配置用户组

3、配置用户

4、配置API Key

五、测试连通性

发表评论 取消回复 使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

LiteLLM部署手册

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款