TruLens: Don't just vibe check your LLM app!
Create credible and powerful LLM apps, faster. TruLens is a software tool that helps you to
objectively measure the quality and effectiveness of your LLM-based applications using feedback
functions. Feedback functions help to programmatically evaluate the quality of inputs, outputs,
and intermediate results, so that you can expedite and scale up experiment evaluation. Use it
for a wide variety of use cases including question answering, summarization, retrieval-augmented generation,
and agent-based applications.
Evaluate
Evaluate how your choices are performing across multiple feedback functions, such as:
- Groundedness
- Context Relevance
- Safety
Iterate
Leverage and add to an extensible library of built-in feedback functions. Observe where apps
have weaknesses to inform iteration on prompts, hyperparameters, and more.
Test
Compare different LLM apps on a metrics leaderboard to pick the best performing one.