Langfuse Flashcards | Uplatz Blog

🧭 Langfuse Flashcards

Open-source LLM observability: traces, evals, prompts, and cost tracking

Langfuse is an LLM observability & analytics platform to log, debug, evaluate, and optimize AI app behavior in production.

Traces (end-to-end runs), Spans (steps), Generations (model calls), and Events (custom logs) capture full context.

JavaScript/TypeScript & Python SDKs; works alongside LangChain, LlamaIndex, OpenAI/Anthropic clients, and custom stacks.

Store prompts with variables, version them, and compare iterations to see which variants perform best.

Collect human feedback (thumbs, numeric, categorical) and run automated evals to score quality, relevance, safety, or accuracy.

Track latency, token counts, and cost per request, user, route, or model to keep budgets under control.

Create datasets of inputs/outputs, run batch tests against models/prompts, and compare results over time.

Filter and drill into traces by user, tag, route, time, or error; inspect prompts, responses, and intermediate steps.

Supports redaction of sensitive fields, configurable retention, and self-hosted or managed deployment options.

Sample high-volume traffic to reduce overhead; tag important runs for always-on logging.

Export logs via API/CSV and trigger webhooks to pipe signals into data warehouses, alerting, or BI tools.

Prod monitoring, A/B testing prompts, RAG quality tracking, agent step debugging, and SLA reporting.