Langfuse joins ClickHouse! →Langfuse joins ClickHouse! Learn more →
Langfuse LogoLangfuse Logo
Hiring in Berlin and SFLooking for GOATS!
DocsSelf HostingGuidesIntegrationsFAQHandbookChangelogPricingLibrarySecurity & Compliance
GitHub
22K
Get DemoAppSign Up
  • DocsIntegrationsSelf Hosting
    Guides
    AI Engineering Library
  • Overview
    • Overview
    • Evaluation of Rag with Ragas
    • Evaluation with Langchain
    • Evaluation with Uptrain
    • Migrating Data from One Langfuse Project to Another
    • Example Decorator Openai Langchain
    • Evaluating Multi-Turn Conversations
    • Example - Tracing and Evaluation for the OpenAI-Agents SDK
    • External Evaluation Pipelines
    • Guide - Building an intent classification pipeline
    • Example - Trace and Evaluate LangGraph Agents
    • Example Llm Security Monitoring
    • Example Multi Modal Traces
    • Agent Evaluation Guide
    • Query Data in Langfuse via the SDK
    • Evaluating Multi-Turn Conversations (Simulation)
    • Synthetic Datasets
    • Amazon Bedrock
    • Anthropic (Python)
    • Integration Azure Openai Langchain
    • Databricks
    • Integration Langchain
    • Open Source Observability for LangGraph
    • Langserve
    • Cookbook - LiteLLM (Proxy) + Langfuse OpenAI Integration + Python Decorator
    • Integration Llama Index Callback
    • Integration Llama Index Instrumentation
    • Integration Llama Index Milvus Lite
    • LlamaIndex
    • Monitoring LlamaIndex applications with PostHog and Langfuse
    • LlamaIndex Workflows
    • OpenAI Assistants API
    • Cookbook - OpenAI Integration (Python)
    • Observe OpenAI Structured Outputs with Langfuse
    • Anthropic (JS/TS)
    • Langchain Integration (JS/TS)
    • LiteLLM (Proxy) + Langfuse OpenAI Integration (JS/TS)
    • OpenAI Integration (JS/TS)
    • JS/TS SDK Example
    • Prompt Management with Langchain (JS)
    • Langfuse SDK Performance Test
    • Tracing using the OpenInference SDK
    • MLflow Integration via OpenTelemetry
    • OpenLIT Integration via OpenTelemetry
    • Otel Integration Openllmetry
    • Example - Langfuse Prompt Management with Langchain (Python)
    • Prompt Management Openai Functions
    • Prompt Management Performance Benchmark
    • Overview
    • Beginner's Guide to RAG Evaluation with Langfuse and Ragas
    • External Evaluation Pipelines
    • Introducing Datasets v2
    • Introducing Langfuse 2.0
    • Introducing the observe() decorator for Python
    • LLM-as-a-Judge Evaluators for Dataset Experiments
    • LLM Playground
    • Posthog Integration
    • Run Langfuse Locally in 3 Minutes
    • Webinar: Traceability and Observability in Multi-Step LLM Systems

On This Page

  • Learn more
Question? Give us feedback →Edit this page on GitHub
Contributors
Marc Klingen
Marc Klingen
Co-founder
GuidesVideosExternal Evaluation Pipelines

External Evaluation Pipelines

Learn more

External Evaluation Pipeline Example
Beginner's Guide to RAG Evaluation with Langfuse and RagasIntroducing Datasets v2
Was this page helpful?
Support

GitHubDiscordXYouTubeLinkedIn

Product

  • Observability
  • Prompt Management
  • Evaluation
  • Metrics
  • Playground
  • Pricing
  • Enterprise

Developers

  • Documentation
  • Python SDK
  • JS/TS SDK
  • Integrations
  • API Reference
  • Self-Hosting
  • Guides & Cookbooks

Resources

  • Blog
  • Changelog
  • Roadmap
  • Interactive Demo
  • Customers
  • AI Engineering Library
  • Status

Company

  • About Us
  • Careers
  • Press
  • Security
  • Support
  • Talk to Us
  • Open Source
TermsPrivacyImprintCookie PolicySOC 2 Type IIISO 27001GDPRHIPAA🇯🇵 Japanese🇰🇷 Korean🇨🇳 Chinese
© 2022-2026 Langfuse GmbH / Finto Technologies Inc.