多文档智能体(V1版)¶
本指南将指导您如何基于LlamaIndex文档搭建一个多文档智能体系统。
这是对V0版多文档智能体的功能扩展,新增特性包括:
- 文档(工具)检索过程中的重排序功能
- 智能体可使用的查询规划工具
我们通过以下架构实现:
- 为每个文档配置"文档智能体":每个文档智能体可在其所属文档内执行问答/摘要生成
- 在这些文档智能体之上建立顶层智能体。先进行工具检索,然后通过工具集合的思维链(CoT)来回答问题。
如果您在 Colab 上打开此 Notebook,可能需要安装 LlamaIndex 🦙。
In [ ]:
Copied!
%pip install llama-index-core
%pip install llama-index-agent-openai
%pip install llama-index-readers-file
%pip install llama-index-postprocessor-cohere-rerank
%pip install llama-index-llms-openai
%pip install llama-index-embeddings-openai
%pip install unstructured[html]
%pip install llama-index-core
%pip install llama-index-agent-openai
%pip install llama-index-readers-file
%pip install llama-index-postprocessor-cohere-rerank
%pip install llama-index-llms-openai
%pip install llama-index-embeddings-openai
%pip install unstructured[html]
In [ ]:
Copied!
%load_ext autoreload
%autoreload 2
%load_ext autoreload
%autoreload 2
In [ ]:
Copied!
domain = "docs.llamaindex.ai"
docs_url = "https://docs.llamaindex.ai/en/latest/"
!wget -e robots=off --recursive --no-clobber --page-requisites --html-extension --convert-links --restrict-file-names=windows --domains {domain} --no-parent {docs_url}
domain = "docs.llamaindex.ai"
docs_url = "https://docs.llamaindex.ai/en/latest/"
!wget -e robots=off --recursive --no-clobber --page-requisites --html-extension --convert-links --restrict-file-names=windows --domains {domain} --no-parent {docs_url}
In [ ]:
Copied!
from llama_index.readers.file import UnstructuredReader
reader = UnstructuredReader()
from llama_index.readers.file import UnstructuredReader
reader = UnstructuredReader()
In [ ]:
Copied!
from pathlib import Path
all_files_gen = Path("./docs.llamaindex.ai/").rglob("*")
all_files = [f.resolve() for f in all_files_gen]
from pathlib import Path
all_files_gen = Path("./docs.llamaindex.ai/").rglob("*")
all_files = [f.resolve() for f in all_files_gen]
In [ ]:
Copied!
all_html_files = [f for f in all_files if f.suffix.lower() == ".html"]
all_html_files = [f for f in all_files if f.suffix.lower() == ".html"]
In [ ]:
Copied!
len(all_html_files)
len(all_html_files)
Out[ ]:
1656
In [ ]:
Copied!
useful_files = [
x
for x in all_html_files
if "understanding" in str(x).split(".")[-2]
or "examples" in str(x).split(".")[-2]
]
print(len(useful_files))
useful_files = [
x
for x in all_html_files
if "understanding" in str(x).split(".")[-2]
or "examples" in str(x).split(".")[-2]
]
print(len(useful_files))
680
In [ ]:
Copied!
from llama_index.core import Document
# TODO: set to higher value if you want more docs to be indexed
doc_limit = 100
docs = []
for idx, f in enumerate(useful_files):
if idx > doc_limit:
break
print(f"Idx {idx}/{len(useful_files)}")
loaded_docs = reader.load_data(file=f, split_documents=True)
loaded_doc = Document(
text="\n\n".join([d.get_content() for d in loaded_docs]),
metadata={"path": str(f)},
)
print(loaded_doc.metadata["path"])
docs.append(loaded_doc)
from llama_index.core import Document
# TODO: set to higher value if you want more docs to be indexed
doc_limit = 100
docs = []
for idx, f in enumerate(useful_files):
if idx > doc_limit:
break
print(f"Idx {idx}/{len(useful_files)}")
loaded_docs = reader.load_data(file=f, split_documents=True)
loaded_doc = Document(
text="\n\n".join([d.get_content() for d in loaded_docs]),
metadata={"path": str(f)},
)
print(loaded_doc.metadata["path"])
docs.append(loaded_doc)
In [ ]:
Copied!
print(len(docs))
print(len(docs))
101
定义全局 LLM + 嵌入模型
In [ ]:
Copied!
import os
os.environ["OPENAI_API_KEY"] = "sk-..."
import os
os.environ["OPENAI_API_KEY"] = "sk-..."
In [ ]:
Copied!
from llama_index.llms.openai import OpenAI
from llama_index.embeddings.openai import OpenAIEmbedding
from llama_index.core import Settings
llm = OpenAI(model="gpt-4o")
Settings.llm = llm
Settings.embed_model = OpenAIEmbedding(
model="text-embedding-3-small", embed_batch_size=256
)
from llama_index.llms.openai import OpenAI
from llama_index.embeddings.openai import OpenAIEmbedding
from llama_index.core import Settings
llm = OpenAI(model="gpt-4o")
Settings.llm = llm
Settings.embed_model = OpenAIEmbedding(
model="text-embedding-3-small", embed_batch_size=256
)
构建多文档智能体¶
本节将展示如何构建多文档智能体。我们首先为每个文档创建单独的文档智能体,然后通过对象索引定义顶层父智能体。
为每份文档构建文档智能体¶
本节我们将为每份文档定义"文档智能体"。
我们为每份文档同时构建向量索引(用于语义搜索)和摘要索引(用于内容概括)。随后将这两个查询引擎转换为工具,传递给基于OpenAI函数调用的智能体。
该文档智能体能够动态选择在给定文档内执行语义搜索或内容摘要操作。
我们为每个城市创建独立的文档智能体。
In [ ]:
Copied!
from llama_index.core.agent.workflow import FunctionAgent, ReActAgent
from llama_index.core import (
load_index_from_storage,
StorageContext,
VectorStoreIndex,
)
from llama_index.core import SummaryIndex
from llama_index.core.tools import QueryEngineTool
from llama_index.core.node_parser import SentenceSplitter
import os
from tqdm.notebook import tqdm
import pickle
async def build_agent_per_doc(nodes, file_base):
vi_out_path = f"./data/llamaindex_docs/{file_base}"
summary_out_path = f"./data/llamaindex_docs/{file_base}_summary.pkl"
if not os.path.exists(vi_out_path):
Path("./data/llamaindex_docs/").mkdir(parents=True, exist_ok=True)
# build vector index
vector_index = VectorStoreIndex(nodes)
vector_index.storage_context.persist(persist_dir=vi_out_path)
else:
vector_index = load_index_from_storage(
StorageContext.from_defaults(persist_dir=vi_out_path),
)
# build summary index
summary_index = SummaryIndex(nodes)
# define query engines
vector_query_engine = vector_index.as_query_engine(llm=llm)
summary_query_engine = summary_index.as_query_engine(
response_mode="tree_summarize", llm=llm
)
# extract a summary
if not os.path.exists(summary_out_path):
Path(summary_out_path).parent.mkdir(parents=True, exist_ok=True)
summary = str(
await summary_query_engine.aquery(
"Extract a concise 1-2 line summary of this document"
)
)
pickle.dump(summary, open(summary_out_path, "wb"))
else:
summary = pickle.load(open(summary_out_path, "rb"))
# define tools
query_engine_tools = [
QueryEngineTool.from_defaults(
query_engine=vector_query_engine,
name=f"vector_tool_{file_base}",
description=f"Useful for questions related to specific facts",
),
QueryEngineTool.from_defaults(
query_engine=summary_query_engine,
name=f"summary_tool_{file_base}",
description=f"Useful for summarization questions",
),
]
# build agent
function_llm = OpenAI(model="gpt-4")
agent = FunctionAgent(
tools=query_engine_tools,
llm=function_llm,
system_prompt=f"""\
You are a specialized agent designed to answer queries about the `{file_base}.html` part of the LlamaIndex docs.
You must ALWAYS use at least one of the tools provided when answering a question; do NOT rely on prior knowledge.\
""",
)
return agent, summary
async def build_agents(docs):
node_parser = SentenceSplitter()
# Build agents dictionary
agents_dict = {}
extra_info_dict = {}
# # this is for the baseline
# all_nodes = []
for idx, doc in enumerate(tqdm(docs)):
nodes = node_parser.get_nodes_from_documents([doc])
# all_nodes.extend(nodes)
# ID will be base + parent
file_path = Path(doc.metadata["path"])
file_base = str(file_path.parent.stem) + "_" + str(file_path.stem)
agent, summary = await build_agent_per_doc(nodes, file_base)
agents_dict[file_base] = agent
extra_info_dict[file_base] = {"summary": summary, "nodes": nodes}
return agents_dict, extra_info_dict
from llama_index.core.agent.workflow import FunctionAgent, ReActAgent
from llama_index.core import (
load_index_from_storage,
StorageContext,
VectorStoreIndex,
)
from llama_index.core import SummaryIndex
from llama_index.core.tools import QueryEngineTool
from llama_index.core.node_parser import SentenceSplitter
import os
from tqdm.notebook import tqdm
import pickle
async def build_agent_per_doc(nodes, file_base):
vi_out_path = f"./data/llamaindex_docs/{file_base}"
summary_out_path = f"./data/llamaindex_docs/{file_base}_summary.pkl"
if not os.path.exists(vi_out_path):
Path("./data/llamaindex_docs/").mkdir(parents=True, exist_ok=True)
# build vector index
vector_index = VectorStoreIndex(nodes)
vector_index.storage_context.persist(persist_dir=vi_out_path)
else:
vector_index = load_index_from_storage(
StorageContext.from_defaults(persist_dir=vi_out_path),
)
# build summary index
summary_index = SummaryIndex(nodes)
# define query engines
vector_query_engine = vector_index.as_query_engine(llm=llm)
summary_query_engine = summary_index.as_query_engine(
response_mode="tree_summarize", llm=llm
)
# extract a summary
if not os.path.exists(summary_out_path):
Path(summary_out_path).parent.mkdir(parents=True, exist_ok=True)
summary = str(
await summary_query_engine.aquery(
"Extract a concise 1-2 line summary of this document"
)
)
pickle.dump(summary, open(summary_out_path, "wb"))
else:
summary = pickle.load(open(summary_out_path, "rb"))
# define tools
query_engine_tools = [
QueryEngineTool.from_defaults(
query_engine=vector_query_engine,
name=f"vector_tool_{file_base}",
description=f"Useful for questions related to specific facts",
),
QueryEngineTool.from_defaults(
query_engine=summary_query_engine,
name=f"summary_tool_{file_base}",
description=f"Useful for summarization questions",
),
]
# build agent
function_llm = OpenAI(model="gpt-4")
agent = FunctionAgent(
tools=query_engine_tools,
llm=function_llm,
system_prompt=f"""\
You are a specialized agent designed to answer queries about the `{file_base}.html` part of the LlamaIndex docs.
You must ALWAYS use at least one of the tools provided when answering a question; do NOT rely on prior knowledge.\
""",
)
return agent, summary
async def build_agents(docs):
node_parser = SentenceSplitter()
# Build agents dictionary
agents_dict = {}
extra_info_dict = {}
# # this is for the baseline
# all_nodes = []
for idx, doc in enumerate(tqdm(docs)):
nodes = node_parser.get_nodes_from_documents([doc])
# all_nodes.extend(nodes)
# ID will be base + parent
file_path = Path(doc.metadata["path"])
file_base = str(file_path.parent.stem) + "_" + str(file_path.stem)
agent, summary = await build_agent_per_doc(nodes, file_base)
agents_dict[file_base] = agent
extra_info_dict[file_base] = {"summary": summary, "nodes": nodes}
return agents_dict, extra_info_dict
In [ ]:
Copied!
agents_dict, extra_info_dict = await build_agents(docs)
agents_dict, extra_info_dict = await build_agents(docs)
构建支持检索器的 OpenAI 智能体¶
我们构建了一个顶层智能体,能够协调不同文档智能体来回答用户的各种查询。
该智能体将使用工具检索器来获取与查询最相关的工具。
相较于 V0 版本的改进:我们相比 V0 的"基础"版本做出了以下优化:
- 加入重排序机制:使用 Cohere 重排序器更好地筛选候选文档集
- 新增查询规划工具:动态创建基于检索工具集的显式查询规划工具
In [ ]:
Copied!
from typing import Callable
from llama_index.core.tools import FunctionTool
def get_agent_tool_callable(agent: FunctionAgent) -> Callable:
async def query_agent(query: str) -> str:
response = await agent.run(query)
return str(response)
return query_agent
# define tool for each document agent
all_tools = []
for file_base, agent in agents_dict.items():
summary = extra_info_dict[file_base]["summary"]
async_fn = get_agent_tool_callable(agent)
doc_tool = FunctionTool.from_defaults(
async_fn,
name=f"tool_{file_base}",
description=summary,
)
all_tools.append(doc_tool)
from typing import Callable
from llama_index.core.tools import FunctionTool
def get_agent_tool_callable(agent: FunctionAgent) -> Callable:
async def query_agent(query: str) -> str:
response = await agent.run(query)
return str(response)
return query_agent
# define tool for each document agent
all_tools = []
for file_base, agent in agents_dict.items():
summary = extra_info_dict[file_base]["summary"]
async_fn = get_agent_tool_callable(agent)
doc_tool = FunctionTool.from_defaults(
async_fn,
name=f"tool_{file_base}",
description=summary,
)
all_tools.append(doc_tool)
In [ ]:
Copied!
print(all_tools[0].metadata)
print(all_tools[0].metadata)
ToolMetadata(description='The document provides a series of tutorials on building agentic LLM applications using LlamaIndex, covering key steps such as building RAG pipelines, agents, and workflows, along with techniques for data ingestion, indexing, querying, and application evaluation.', name='tool_understanding_index', fn_schema=<class 'llama_index.core.tools.utils.tool_understanding_index'>, return_direct=False)
In [ ]:
Copied!
# define an "object" index and retriever over these tools
from llama_index.core import VectorStoreIndex
from llama_index.core.objects import (
ObjectIndex,
ObjectRetriever,
)
from llama_index.postprocessor.cohere_rerank import CohereRerank
from llama_index.core.query_engine import SubQuestionQueryEngine
from llama_index.core.schema import QueryBundle
from llama_index.llms.openai import OpenAI
llm = OpenAI(model_name="gpt-4o")
obj_index = ObjectIndex.from_objects(
all_tools,
index_cls=VectorStoreIndex,
)
vector_node_retriever = obj_index.as_node_retriever(
similarity_top_k=10,
)
# define a custom object retriever that adds in a query planning tool
class CustomObjectRetriever(ObjectRetriever):
def __init__(
self,
retriever,
object_node_mapping,
node_postprocessors=None,
llm=None,
):
self._retriever = retriever
self._object_node_mapping = object_node_mapping
self._llm = llm or OpenAI("gpt-4o")
self._node_postprocessors = node_postprocessors or []
def retrieve(self, query_bundle):
if isinstance(query_bundle, str):
query_bundle = QueryBundle(query_str=query_bundle)
nodes = self._retriever.retrieve(query_bundle)
for processor in self._node_postprocessors:
nodes = processor.postprocess_nodes(
nodes, query_bundle=query_bundle
)
tools = [self._object_node_mapping.from_node(n.node) for n in nodes]
sub_agent = FunctionAgent(
name="compare_tool",
description=f"""\
Useful for any queries that involve comparing multiple documents. ALWAYS use this tool for comparison queries - make sure to call this \
tool with the original query. Do NOT use the other tools for any queries involving multiple documents.
""",
tools=tools,
llm=self._llm,
system_prompt="""You are an expert at comparing documents. Given a query, use the tools provided to compare the documents and return a summary of the results.""",
)
async def query_sub_agent(query: str) -> str:
response = await sub_agent.run(query)
return str(response)
sub_question_tool = FunctionTool.from_defaults(
query_sub_agent,
name=sub_agent.name,
description=sub_agent.description,
)
return tools + [sub_question_tool]
# define an "object" index and retriever over these tools
from llama_index.core import VectorStoreIndex
from llama_index.core.objects import (
ObjectIndex,
ObjectRetriever,
)
from llama_index.postprocessor.cohere_rerank import CohereRerank
from llama_index.core.query_engine import SubQuestionQueryEngine
from llama_index.core.schema import QueryBundle
from llama_index.llms.openai import OpenAI
llm = OpenAI(model_name="gpt-4o")
obj_index = ObjectIndex.from_objects(
all_tools,
index_cls=VectorStoreIndex,
)
vector_node_retriever = obj_index.as_node_retriever(
similarity_top_k=10,
)
# define a custom object retriever that adds in a query planning tool
class CustomObjectRetriever(ObjectRetriever):
def __init__(
self,
retriever,
object_node_mapping,
node_postprocessors=None,
llm=None,
):
self._retriever = retriever
self._object_node_mapping = object_node_mapping
self._llm = llm or OpenAI("gpt-4o")
self._node_postprocessors = node_postprocessors or []
def retrieve(self, query_bundle):
if isinstance(query_bundle, str):
query_bundle = QueryBundle(query_str=query_bundle)
nodes = self._retriever.retrieve(query_bundle)
for processor in self._node_postprocessors:
nodes = processor.postprocess_nodes(
nodes, query_bundle=query_bundle
)
tools = [self._object_node_mapping.from_node(n.node) for n in nodes]
sub_agent = FunctionAgent(
name="compare_tool",
description=f"""\
Useful for any queries that involve comparing multiple documents. ALWAYS use this tool for comparison queries - make sure to call this \
tool with the original query. Do NOT use the other tools for any queries involving multiple documents.
""",
tools=tools,
llm=self._llm,
system_prompt="""You are an expert at comparing documents. Given a query, use the tools provided to compare the documents and return a summary of the results.""",
)
async def query_sub_agent(query: str) -> str:
response = await sub_agent.run(query)
return str(response)
sub_question_tool = FunctionTool.from_defaults(
query_sub_agent,
name=sub_agent.name,
description=sub_agent.description,
)
return tools + [sub_question_tool]
In [ ]:
Copied!
# wrap it with ObjectRetriever to return objects
custom_obj_retriever = CustomObjectRetriever(
vector_node_retriever,
obj_index.object_node_mapping,
node_postprocessors=[CohereRerank(top_n=5, model="rerank-v3.5")],
llm=llm,
)
# wrap it with ObjectRetriever to return objects
custom_obj_retriever = CustomObjectRetriever(
vector_node_retriever,
obj_index.object_node_mapping,
node_postprocessors=[CohereRerank(top_n=5, model="rerank-v3.5")],
llm=llm,
)
In [ ]:
Copied!
tmps = custom_obj_retriever.retrieve("hello")
# should be 5 + 1 -- 5 from reranker, 1 from subquestion
print(len(tmps))
tmps = custom_obj_retriever.retrieve("hello")
# should be 5 + 1 -- 5 from reranker, 1 from subquestion
print(len(tmps))
6
In [ ]:
Copied!
from llama_index.core.agent.workflow import ReActAgent, FunctionAgent
top_agent = FunctionAgent(
tool_retriever=custom_obj_retriever,
system_prompt=""" \
You are an agent designed to answer queries about the documentation.
Please always use the tools provided to answer a question. Do not rely on prior knowledge.\
""",
llm=llm,
)
# top_agent = ReActAgent(
# tool_retriever=custom_obj_retriever,
# system_prompt=""" \
# You are an agent designed to answer queries about the documentation.
# Please always use the tools provided to answer a question. Do not rely on prior knowledge.\
# """,
# llm=llm,
# )
from llama_index.core.agent.workflow import ReActAgent, FunctionAgent
top_agent = FunctionAgent(
tool_retriever=custom_obj_retriever,
system_prompt=""" \
You are an agent designed to answer queries about the documentation.
Please always use the tools provided to answer a question. Do not rely on prior knowledge.\
""",
llm=llm,
)
# top_agent = ReActAgent(
# tool_retriever=custom_obj_retriever,
# system_prompt=""" \
# You are an agent designed to answer queries about the documentation.
# Please always use the tools provided to answer a question. Do not rely on prior knowledge.\
# """,
# llm=llm,
# )
In [ ]:
Copied!
all_nodes = [
n for extra_info in extra_info_dict.values() for n in extra_info["nodes"]
]
all_nodes = [
n for extra_info in extra_info_dict.values() for n in extra_info["nodes"]
]
In [ ]:
Copied!
base_index = VectorStoreIndex(all_nodes)
base_query_engine = base_index.as_query_engine(similarity_top_k=4)
base_index = VectorStoreIndex(all_nodes)
base_query_engine = base_index.as_query_engine(similarity_top_k=4)
运行示例查询¶
让我们执行一些示例查询,范围涵盖单个文档的问答/摘要任务到多个文档的问答/摘要处理。
In [ ]:
Copied!
from llama_index.core.agent.workflow import (
AgentStream,
ToolCall,
ToolCallResult,
)
handler = top_agent.run(
"What can you build with LlamaIndex?",
)
async for ev in handler.stream_events():
if isinstance(ev, ToolCallResult):
print(
f"\nCalling tool {ev.tool_name} with args {ev.tool_kwargs}\n Got response: {str(ev.tool_output)[:200]}"
)
elif isinstance(ev, ToolCall):
print(f"\nTool call: {ev.tool_name} with args {ev.tool_kwargs}")
# Print the stream of the agent
# elif isinstance(ev, AgentStream):
# print(ev.delta, end="", flush=True)
response = await handler
from llama_index.core.agent.workflow import (
AgentStream,
ToolCall,
ToolCallResult,
)
handler = top_agent.run(
"What can you build with LlamaIndex?",
)
async for ev in handler.stream_events():
if isinstance(ev, ToolCallResult):
print(
f"\nCalling tool {ev.tool_name} with args {ev.tool_kwargs}\n Got response: {str(ev.tool_output)[:200]}"
)
elif isinstance(ev, ToolCall):
print(f"\nTool call: {ev.tool_name} with args {ev.tool_kwargs}")
# Print the stream of the agent
# elif isinstance(ev, AgentStream):
# print(ev.delta, end="", flush=True)
response = await handler
Tool call: tool_SimpleIndexDemoLlama2_index with args {'query': 'What can you build with LlamaIndex?'}
Tool call: tool_apps_index with args {'query': 'What can you build with LlamaIndex?'}
Tool call: tool_putting_it_all_together_index with args {'query': 'What can you build with LlamaIndex?'}
Tool call: tool_llamacloud_index with args {'query': 'What can you build with LlamaIndex?'}
Calling tool tool_SimpleIndexDemoLlama2_index with args {'query': 'What can you build with LlamaIndex?'}
Got response: With LlamaIndex, you can build a VectorStoreIndex. This involves setting up the necessary environment, loading documents into the index, and then querying the index for information. You need to instal
Tool call: tool_using_llms_index with args {'query': 'What can you build with LlamaIndex?'}
Calling tool tool_llamacloud_index with args {'query': 'What can you build with LlamaIndex?'}
Got response: With LlamaIndex, you can build a system that connects to your data stores, automatically indexes them, and then queries the data. This is done by integrating LlamaCloud into your project. The system a
Calling tool tool_apps_index with args {'query': 'What can you build with LlamaIndex?'}
Got response: With LlamaIndex, you can build a full-stack web application. You can integrate it into a backend server like Flask, package it into a Docker container, or use it directly in a framework such as Stream
Calling tool tool_putting_it_all_together_index with args {'query': 'What can you build with LlamaIndex?'}
Got response: With LlamaIndex, you can build a variety of applications and tools. This includes:
1. Chatbots: You can use LlamaIndex to create interactive chatbots.
2. Agents: LlamaIndex can be used to build intel
Calling tool tool_using_llms_index with args {'query': 'What can you build with LlamaIndex?'}
Got response: With LlamaIndex, you can build a variety of applications by leveraging the various Language Model (LLM) integrations it supports. These include OpenAI, Anthropic, Mistral, DeepSeek, Hugging Face, and
In [ ]:
Copied!
# print the final response string
print(str(response))
# print the final response string
print(str(response))
With LlamaIndex, you can build various applications and tools, including: 1. **VectorStoreIndex**: Set up and query a VectorStoreIndex by loading documents and configuring the environment as per the documentation. 2. **Full-Stack Web Applications**: Integrate LlamaIndex into backend servers like Flask, Docker containers, or frameworks like Streamlit. Resources include guides for TypeScript+React, Delphic starter template, and Flask, Streamlit, and Docker integration examples. 3. **Chatbots, Agents, and Unified Query Framework**: Create interactive chatbots, intelligent agents, and a unified query framework for handling different query types. LlamaIndex also supports property graphs and full-stack web applications. 4. **Data Management with LlamaCloud**: Build systems that connect to data stores, automatically index data, and efficiently query it by integrating LlamaCloud into your project. 5. **LLM Integrations**: Utilize various Language Model (LLM) integrations such as OpenAI, Anthropic, Mistral, DeepSeek, and Hugging Face. LlamaIndex provides a unified interface to access different LLMs, enabling you to select models based on their strengths and price points. You can use multi-modal LLMs for chat messages with text, images, and audio inputs, and even call tools and functions directly through API calls. These capabilities make LlamaIndex a versatile tool for building a wide range of applications and systems.
In [ ]:
Copied!
# access the tool calls
# print(response.tool_calls)
# access the tool calls
# print(response.tool_calls)
In [ ]:
Copied!
# baseline
response = base_query_engine.query(
"What can you build with LlamaIndex?",
)
print(str(response))
# baseline
response = base_query_engine.query(
"What can you build with LlamaIndex?",
)
print(str(response))
With LlamaIndex, you can build a variety of applications and systems, including a full-stack web application, a chatbot, and a unified query framework over multiple indexes. You can also perform semantic searches, summarization queries, and queries over structured data like SQL or Pandas DataFrames. Additionally, LlamaIndex supports routing over heterogeneous data sources and compare/contrast queries. It provides tools and templates to help you integrate these capabilities into production-ready applications.
In [ ]:
Copied!
response = await top_agent.run("Compare workflows to query engines")
print(str(response))
response = await top_agent.run("Compare workflows to query engines")
print(str(response))
Workflows and query engines serve different purposes in an application context: 1. Workflows: - Workflows are designed to manage the execution flow of an application by dividing it into sections triggered by events. - They are event-driven and step-based, allowing for the management of application complexity by breaking it into smaller, more manageable pieces. - Workflows focus on controlling the flow of application execution through steps and events. 2. Query Engines: - Query engines are tools used to process queries against a database or data source to retrieve specific information. - They are primarily used for querying and retrieving data from databases. - Query engines are focused on the retrieval, postprocessing, and response synthesis stages of querying. In summary, workflows are more about controlling the flow of application execution, while query engines are specifically designed for querying and retrieving data from databases.
In [ ]:
Copied!
response = await top_agent.run(
"Can you compare the compact and tree_summarize response synthesizer response modes at a very high-level?"
)
print(str(response))
response = await top_agent.run(
"Can you compare the compact and tree_summarize response synthesizer response modes at a very high-level?"
)
print(str(response))
The compact response synthesizer mode aims to produce concise and condensed responses, focusing on delivering the most relevant information in a brief format. On the other hand, the tree_summarize response synthesizer mode is designed to create structured and summarized responses, organizing information in a comprehensive manner. In summary, the compact mode provides brief and straightforward responses, while the tree_summarize mode offers more detailed and organized output for a comprehensive summary.