Find answers from the community

Updated 3 months ago

👋 Is there a way to change the default

👋 Is there a way to change the default templates in schema.py (specifically DEFAULT_TEXT_NODE_TMPL)? The template is used by the get_content method on the TextNode.

20 comments

LLogan M

There is!

LLogan M

https://docs.llamaindex.ai/en/stable/module_guides/loading/documents_and_nodes/usage_documents.html#summary

LLogan M

Plain Text

document = Document(
    text="This is a super-customized document",
    metadata={
        "file_name": "super_secret_document.txt",
        "category": "finance",
        "author": "LlamaIndex",
    },
    excluded_llm_metadata_keys=["file_name"],
    metadata_seperator="::",
    metadata_template="{key}=>{value}",
    text_template="Metadata: {metadata_str}\n-----\nContent: {content}",
)

PPocketColin

perfect! But now wait what's the difference between a document and a node? I thoguht I was working with nodes here

PPocketColin

oh..... I think I see. So is this something I'd have to connect to SimpleDirectoryReader?

LLogan M

there is nearly zero difference between a document and node

LLogan M

mostly just naming/perception lol

LLogan M

the classes are nearly identical

PPocketColin

ohhhhh haha

PPocketColin

so when I iterate through all of my nodes after pulling them out of a PDF, should I just instantiate new Nodes with all of this customization added? I'm guessing I could do:

Plain Text

node = TextNode(
  text="blah blah",
...
)

LLogan M

Yea you can do that! Or you can just modify the existing nodes if you have them

LLogan M

node.text_template = "..."

PPocketColin

🤯

PPocketColin

of course I can. How did I miss that! Thanks!

PPocketColin

Ok haha followup question! VectorStoreIndex().build_index_from_nodes(nodes) returns an IndexDict type object but I really need the VectorStoreIndex. Should I just be passing nodes to .from_documents(nodes) instead?

PPocketColin

Except no that doesn't work because .from_documents expects the list items to have .get_doc_id

LLogan M

Use VectorStoreIndex(nodes, ...)

PPocketColin

oh ok I thought I'd still need to call some sort of processing method but I guess not

PPocketColin

thanks!

LLogan M

build_index_from_nodes() is actually called from the base constructor 👍

Add a reply