Nebius 大语言模型¶

本笔记本演示了如何将 Nebius AI Studio 提供的大语言模型与 LlamaIndex 结合使用。Nebius AI Studio 实现了所有可供商业使用的最先进大语言模型。

首先，安装 LlamaIndex 和 Nebius AI Studio 的依赖项。

In [ ]:

Copied!

%pip install llama-index-llms-nebius llama-index
%pip install llama-index-llms-nebius llama-index

请在下方从系统变量上传您的 Nebius AI Studio 密钥，或直接粘贴密钥内容。您可以通过免费注册 Nebius AI Studio 并在 API 密钥管理页面生成密钥来获取。

In [ ]:

Copied!

import os

NEBIUS_API_KEY = os.getenv("NEBIUS_API_KEY")  # NEBIUS_API_KEY = ""
import os

NEBIUS_API_KEY = os.getenv("NEBIUS_API_KEY")  # NEBIUS_API_KEY = ""

In [ ]:

Copied!

from llama_index.llms.nebius import NebiusLLM

llm = NebiusLLM(
    api_key=NEBIUS_API_KEY, model="meta-llama/Llama-3.3-70B-Instruct-fast"
)
from llama_index.llms.nebius import NebiusLLM

llm = NebiusLLM(
    api_key=NEBIUS_API_KEY, model="meta-llama/Llama-3.3-70B-Instruct-fast"
)

None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.

调用 `complete` 方法并传入提示词¶

In [ ]:

Copied!

response = llm.complete("Amsterdam is the capital of ")
print(response)
response = llm.complete("Amsterdam is the capital of ")
print(response)

The Netherlands! Amsterdam is indeed the capital and largest city of the Netherlands.

使用消息列表调用 `chat` 方法¶

In [ ]:

Copied!





from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(role="system", content="You are a helpful AI assistant."),
    ChatMessage(
        role="user",
        content="Answer briefly: who is Wall-e?",
    ),
]
response = llm.chat(messages)
print(response)
from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(role="system", content="You are a helpful AI assistant."),
    ChatMessage(
        role="user",
        content="Answer briefly: who is Wall-e?",
    ),
]
response = llm.chat(messages)
print(response)

assistant: WALL-E is a small waste-collecting robot and the main character in the 2008 Pixar animated film of the same name.

流式传输¶

使用 `stream_complete` 端点¶

In [ ]:

Copied!

response = llm.stream_complete("Amsterdam is the capital of ")
for r in response:
    print(r.delta, end="")
response = llm.stream_complete("Amsterdam is the capital of ")
for r in response:
    print(r.delta, end="")

The Netherlands! Amsterdam is indeed the capital and largest city of the Netherlands.

使用 `stream_chat` 处理消息列表¶

In [ ]:

Copied!





from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(role="system", content="You are a helpful AI assistant."),
    ChatMessage(
        role="user",
        content="Answer briefly: who is Wall-e?",
    ),
]
response = llm.stream_chat(messages)
for r in response:
    print(r.delta, end="")
from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(role="system", content="You are a helpful AI assistant."),
    ChatMessage(
        role="user",
        content="Answer briefly: who is Wall-e?",
    ),
]
response = llm.stream_chat(messages)
for r in response:
    print(r.delta, end="")

WALL-E is a small waste-collecting robot and the main character in the 2008 Pixar animated film of the same name.

Nebius 大语言模型¶

调用 complete 方法并传入提示词¶

使用消息列表调用 chat 方法¶

流式传输¶

使用 stream_complete 端点¶

使用 stream_chat 处理消息列表¶

调用 `complete` 方法并传入提示词¶

使用消息列表调用 `chat` 方法¶

使用 `stream_complete` 端点¶

使用 `stream_chat` 处理消息列表¶