Instrumentation Overview¶

TruLens is a framework that helps you instrument and evaluate LLM apps including RAGs and agents.

Because TruLens is tech-agnostic, we offer a few different tools for instrumentation. * TruCustomApp gives you the most power to instrument a custom LLM app, and provides the instrument method. * TruBasicApp is a simple interface to capture the input and output of a basic LLM app. * TruChain instruments LangChain apps. Read more. * TruLlama instruments LlamaIndex apps. Read more. * TruRails instruments NVIDIA Nemo Guardrails apps. Read more.

In any framework you can track (and evaluate) the inputs, outputs and instrumented internals, along with a wide variety of usage metrics and metadata, detailed below:

Usage Metrics¶

Number of requests (n_requests)
Number of successful ones (n_successful_requests)
Number of class scores retrieved (n_classes)
Total tokens processed (n_tokens)
In streaming mode, number of chunks produced (n_stream_chunks)
Number of prompt tokens supplied (n_prompt_tokens)
Number of completion tokens generated (n_completion_tokens)
Cost in USD (cost)

Read more about Usage Tracking in Cost API Reference.

App Metadata¶

App ID (app_id) - user supplied string or automatically generated hash
Tags (tags) - user supplied string
Model metadata - user supplied json

Record Metadata¶

Record ID (record_id) - automatically generated, track individual application calls
Timestamp (ts) - automatically tracked, the timestamp of the application call
Latency (latency) - the difference between the application call start and end time.

Using @instrument

from trulens.apps.custom import instrument

class RAG_from_scratch:
    @instrument
    def retrieve(self, query: str) -> list:
        """
        Retrieve relevant text from vector store.
        """

    @instrument
    def generate_completion(self, query: str, context_str: list) -> str:
        """
        Generate answer from context.
        """

    @instrument
    def query(self, query: str) -> str:
        """
        Retrieve relevant text given a query, and then generate an answer from the context.
        """

In cases you do not have access to a class to make the necessary decorations for tracking, you can instead use one of the static methods of instrument, for example, the alternative for making sure the custom retriever gets instrumented is via instrument.method. See a usage example below:

Using instrument.method

from trulens.apps.custom import instrument
from somepackage.from custom_retriever import CustomRetriever

instrument.method(CustomRetriever, "retrieve_chunks")

# ... rest of the custom class follows ...

Read more about instrumenting custom class applications in the API Reference

Tracking input-output applications¶

For basic tracking of inputs and outputs, TruBasicApp can be used for instrumentation.

Any text-to-text application can be simply wrapped with TruBasicApp, and then recorded as a context manager.

Using TruBasicApp to log text to text apps

from trulens.apps.basic import TruBasicApp

def custom_application(prompt: str) -> str:
    return "a response"

basic_app_recorder = TruBasicApp(
    custom_application, app_id="Custom Application v1"
)

with basic_app_recorder as recording:
    basic_app_recorder.app("What is the phone number for HR?")

For frameworks with deep integrations, TruLens can expose additional internals of the application for tracking. See TruChain and TruLlama for more details.