seeds
Evals for a RAG bot: a hands-on loop for conversation designers
A field-tested workflow for catching hallucinations, citation drift, and unsafe outputs on a Langfuse-traced bot, written for designers who have never run an eval before.
1 entry