Context Adherence Luna

Understand Galileo's Context Adherence Luna Metric

Definition: Measures whether your model's response was purely based on the context provided.

Calculation: Context Adherence Luna is computed using a fine-tuned in-house Galileo evaluation model. The model is a transformer-based encoder that predicts the probability of Context Adherence for an input response and context. The model is trained on carefully curated RAG datasets and optimized to mimic the Context Adherence Plus metric.

The same model is used to compute Chunk Adherence, Chunk Completeness, Chunk Attribution and Utilization, and a single inference call is used to compute all the Luna metrics at once.

Explainability

The Luna model identifies which parts of the response are not adhering to the context provided. These sections can be highlighted in the response by clicking on the 👁️ icon next to the Context Adherence metric value in LLM or Chat nodes.

What to Do When Context Adherence Is Low?

When a response is highly adherent to the context (i.e., it has a value of 1 or close to 1), it strictly includes information from the provided context. However, when a response is not adherent (i.e., it has a value of 0 or close to 0), it likely contains facts not present in the given context.

Several factors can contribute to low context adherence:

  1. Insufficient Context: If the source document lacks key information needed to answer the user's question, the response may be incomplete or off-topic. To address this, users should consider using various context enrichment strategies to ensure that the source documents retrieved contain the necessary information to answer the user's questions effectively.

  2. Lack of Internal Reasoning and Creativity: While Retrieval-Augmented Generation (RAG) focuses on factual grounding, it doesn't directly enhance the internal reasoning processes of the LLM. This limitation can cause the model to struggle with logic or common-sense reasoning, potentially resulting in nonsensical outputs even if the facts are accurate.

  3. Lack of Contextual Awareness: Although RAG provides factual grounding for the language model, it might not fully understand the nuances of the prompt or user intent. This can lead to the model incorporating irrelevant information or missing key points, thus affecting the overall quality of the response.

To improve context adherence, we recommend:

  1. Ensuring your context DB has all the necessary info to answer the question

  2. Adjusting the prompt to tell the model to stick to the information it's given in the context.

Last updated