Llama 3 Preferred RAG Prompting Format (xml tags vs. markdown vs. something else) #450

krumeto · 2024-04-21T16:01:59Z

🚀 The feature, motivation and pitch

Anthropic directly states that their models prefer context for longer prompts (like the usual RAG applications) to be inserted in XML tags. Some claim OpenAI's models prefer markdown-style (their docs mention both markdown and XML tags).

Does Llama 3 have a preferred format for longer prompts?

Thank you in advance!

Alternatives

No response

Additional context

No response

jeffxtang · 2024-04-22T15:42:05Z

There's no mention of a preferred format for Llama 3. According to the Llama 3 model card prompt format, you just need to follow the new Llama 3 format there (also specified in HF's blog here), but if you use a framework LangChain or service provider like Groq/Replicate or run Llama 3 locally using Ollama for your RAG apps, most likely you won't need to deal with the new prompt format directly as it's been hardcoded by them under the hood. Just use an appropriate RAG prompt (e.g. rag-prompt) with your question, context and possibly chat history for Llama 3 to answer.

krumeto · 2024-04-23T09:49:16Z

Thank you, @jeffxtang! I am aware of the new prompt format.

I was asking more about model preferences regarding RAG type of prompts and longer input prompts.

Example 1 - XML tags (aka the way Anthropic recommends their models to be prompted):

Please analyze this document and write a detailed summmary memo according to the instructions below, following the format given in the example:
<document>
{{DOCUMENT}}
</document>

<instructions>
{{DETAILED_INSTRUCTIONS}}
</instructions>

<example>
{{EXAMPLE}}
</example>

Example 2 (formatted as markdown):

Please analyze this document and write a detailed summmary memo according to the instructions below, following the format given in the example:

## Document
{{DOCUMENT}}

## Instructions
{{DETAILED_INSTRUCTIONS}}

## Examples
{{EXAMPLE}}

Example 3 (special tokens, like for example - https://huggingface.co/jondurbin/airoboros-l2-c70b-3.1.2):

BEGININPUT
Please analyze this document and write a detailed summmary memo according to the instructions below, following the format given in the example:

BEGINCONTEXT
{{DOCUMENT}}
ENDCONTEXT

BEGINEXAMPLES
{{DOCUMENT}}
ENDEXAMPLES
ENDINPUT

BEGININSTRUCTION
{{DETAILED_INSTRUCTIONS}}
ENDINSTRUCTION

Is there a format that Llama-3 Instruct models prefer?

jeffxtang · 2024-04-25T17:50:57Z

I'm not aware of such preference for Llama 3, but it should be easy, with some automated RAG evaluation frameworks (there're quite a few nice open source frameworks), to compare the results of the example with different formats and see if there's any quality difference. @krumeto

trivikramak · 2024-04-30T09:33:20Z

Hi @krumeto, were you able to find out what works best with Llama-3 ?

krumeto · 2024-04-30T10:58:11Z

Hey @trivikramak, no, not yet (apologies).

scottstirling · 2024-05-19T15:58:35Z

Ask it and see what it says. Try some different stuff. Interesting question.

HamidShojanazeri assigned jeffxtang Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3 Preferred RAG Prompting Format (xml tags vs. markdown vs. something else) #450

Llama 3 Preferred RAG Prompting Format (xml tags vs. markdown vs. something else) #450

krumeto commented Apr 21, 2024

jeffxtang commented Apr 22, 2024

krumeto commented Apr 23, 2024

jeffxtang commented Apr 25, 2024

trivikramak commented Apr 30, 2024

krumeto commented Apr 30, 2024

scottstirling commented May 19, 2024

Llama 3 Preferred RAG Prompting Format (xml tags vs. markdown vs. something else) #450

Llama 3 Preferred RAG Prompting Format (xml tags vs. markdown vs. something else) #450

Comments

krumeto commented Apr 21, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

jeffxtang commented Apr 22, 2024

krumeto commented Apr 23, 2024

jeffxtang commented Apr 25, 2024

trivikramak commented Apr 30, 2024

krumeto commented Apr 30, 2024

scottstirling commented May 19, 2024