[Question]: How to use custom model with VectorStoreIndex? #8398

barvhaim · 2023-08-31T07:01:09Z

barvhaim
Aug 31, 2023

Question Validation

I have searched both the documentation and discord for an answer.

Question

When I'm trying to use llama-index VectorStoreIndex.from_documents without OpenAI it will download and use a default llama-2 model (llama-2-13b-chat.ggmlv3.q4_0.bin) as mentioned in https://gpt-index.readthedocs.io/en/stable/examples/llm/llama_2_llama_cpp.html.

I managed to load different model and use it with constructing LlamaCPP object as mentioned in doc, but how can I use it with VectorStoreIndex.from_documents? was not clear from the documentation

@dosu-bot · 2023-08-31T07:08:31Z

dosubot[bot]
bot Aug 31, 2023

🤖

Hello,

Based on the similar issues you've found, it seems you can use a custom model with VectorStoreIndex.from_documents by loading the model and tokenizer directly as kwargs if you can load them outside of the huggingface LLM.

Here's an example of how you might do this:

from llama_index import VectorStoreIndex
from transformers import AutoTokenizer, AutoModel

# Load your custom model and tokenizer
tokenizer = AutoTokenizer.from_pretrained("path/to/your/tokenizer")
model = AutoModel.from_pretrained("path/to/your/model")

# Use them with VectorStoreIndex.from_documents
index = VectorStoreIndex.from_documents(
    documents=["doc1", "doc2", "doc3"],
    tokenizer=tokenizer,
    model=model
)

In this example, replace "path/to/your/tokenizer" and "path/to/your/model" with the paths to your custom tokenizer and model.

If you're unable to load your model and tokenizer outside of the huggingface LLM, you might need to implement a custom LLM class. You can find more information on how to do this in the LlamaIndex documentation.

I hope this helps! Let me know if you have any other questions.

Sources

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

0 replies

barvhaim · 2023-08-31T08:08:20Z

barvhaim
Aug 31, 2023
Author

What if I want to use the LlamaCPP object as model for VectorStoreIndex.from_documents?

0 replies

logan-markewich · 2023-08-31T16:12:13Z

logan-markewich
Aug 31, 2023
Maintainer

@barvhaim Once you have the llm object, just throw it into the service context

(NOTE: make sure you have llama-cpp-python <= 0.1.78 installed if using GGML files, after that version they switched to GGUF)

from llama_index.llms import LlamaCPP
from llama_index.llms.llama_utils import messages_to_prompt, completion_to_prompt

llm = LlamaCPP(
    # You can pass in the URL to a GGML model to download it automatically
    model_url="https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/resolve/main/llama-2-13b-chat.ggmlv3.q4_0.bin",
    # optionally, you can set the path to a pre-downloaded model instead of model_url
    model_path=None,
    temperature=0.1,
    max_new_tokens=256,
    # llama2 has a context window of 4096 tokens, but we set it lower to allow for some wiggle room
    context_window=3900,
    # kwargs to pass to __call__()
    generate_kwargs={},
    # kwargs to pass to __init__()
    # set to at least 1 to use GPU
    model_kwargs={"n_gpu_layers": 1},
    # transform inputs into Llama2 format
    messages_to_prompt=messages_to_prompt,
    completion_to_prompt=completion_to_prompt,
    verbose=True,
)

from llama_index import ServiceContext, set_global_service_context

ctx = ServiceContext.from_defaults(llm=llm)
set_global_service_context(ctx)

Also important is those mesages_to_prompt and completion_to_prompt function. Most models need a specific input format, and those functions convert it. Right now, the ones in that example are specific to llama-2-chat

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: How to use custom model with VectorStoreIndex? #8398

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

[Question]: How to use custom model with VectorStoreIndex? #8398

barvhaim Aug 31, 2023

Question Validation

Question

Replies: 3 comments

dosubot[bot] bot Aug 31, 2023

Sources

barvhaim Aug 31, 2023 Author

logan-markewich Aug 31, 2023 Maintainer

barvhaim
Aug 31, 2023

dosubot[bot]
bot Aug 31, 2023

barvhaim
Aug 31, 2023
Author

logan-markewich
Aug 31, 2023
Maintainer