How Cloudflare Workers Minimize Latency

Benchmarking Helicone’s Proxy Service

Results

API Reference

Guides

Helicone OSS LLM Observability

Dashboard

Discord

Github

Helicone minimizes latency for your LLM applications using Cloudflare's global network. Detailed benchmarking results and performance metrics included.

Latency

Latency Impact

Latency Impact - Helicone OSS LLM Observability

Get started with Helicone, the open-source LLM observability platform for developers to monitor, debug, and optimize their applications.

Introduction

Introduction to Helicone - Open Source LLM Observability

Comprehensive guide to all Helicone headers. Learn how to access and implement various Helicone features through custom request headers.

Header Directory

Helicone Header Directory

Helicone Header Directory - Open Source LLM Observability

Complete guide to Helicone authentication. Learn about API key types, permissions, regional variations, and alternative authentication methods.

Helicone Auth

Helicone Authentication

Helicone Authentication - Open Source LLM Observability

Comprehensive guides to help you deploy and manage your own instance of Helicone.

Overview

Self-Hosting Helicone

Set up Helicone on cloud infrastructure. Step-by-step instructions for deploying the LLM observability platform on a single node or Kubernetes cluster in the cloud.

Cloud

Cloud Deployment

Cloud Deployment of Helicone - Open Source LLM Observability

Deploy Helicone using Docker Compose. Quick setup guide for running a containerized instance of the LLM observability platform on your local machine or server.

Docker Compose

Docker Compose Self-Hosting

Docker Compose Deployment - Helicone OSS LLM Observability

Deploy Helicone using Kubernetes and Helm. Quick setup guide for running a containerized instance of the LLM observability platform on your Kubernetes cluster.

Kubernetes

Kubernetes Self-Hosting

Kubernetes Deployment - Helicone OSS LLM Observability

Deploy your own instance of Helicone. Follow our step-by-step guide to set up a local, self-hosted version of the LLM observability platform.

Manual

Manual Self-Hosting

Self-Hosted Helicone Deployment - Open Source LLM Observability

The optimal method to integrate with Helicone is through our Gateway. This API provides a seamless interface for sending requests and responses to Helicone.

Gateway Integration

Integrate any custom LLM, including open-source models like Llama and GPT-Neo, with Helicone. Step-by-step guide for both NodeJS and Curl implementations to connect your proprietary or open-source models.

Custom Model

Custom Model Integration

Custom Model Integration - Helicone OSS LLM Observability

Log LLM traces directly to Helicone, bypassing our proxy, with OpenLLMetry. Supports OpenAI, Anthropic, Azure OpenAI, Cohere, Bedrock, Google AI Platform, and more.

OpenLLMetry

OpenLLMetry Async Integration

OpenLLMetry Async Integration - Helicone OSS LLM Observability

Use OpenAI's JavaScript SDK to integrate with Helicone to log your OpenAI usage.

JavaScript

OpenAI JavaScript SDK Integration

OpenAI JavaScript SDK Integration - Helicone OSS LLM Observability

Use OpenAI's Python SDK to integrate with Helicone to log your OpenAI usage.

Python

OpenAI Python SDK Integration

OpenAI Python SDK Integration - Helicone OSS LLM Observability

Use LangChain to integrate OpenAI with Helicone to log your OpenAI usage.

LangChain

OpenAI LangChain Integration

OpenAI Langchain Integration - Helicone OSS LLM Observability

Use LlamaIndex to integrate with Helicone to log your LlamaIndex usage.

LlamaIndex

LlamaIndex Integration

LlamaIndex Integration - Helicone OSS LLM Observability

Integrate Helicone with Crew AI, a multi-agent framework for automating complex workflows. Monitor AI-driven tasks and agent interactions for real use cases.

Crew AI

Crew AI Integration

Crew AI Integration - Helicone OSS LLM Observability

Use cURL to integrate OpenAI with Helicone to log your OpenAI usage.

cURL

OpenAI cURL Integration

OpenAI cURL Integration - Helicone OSS LLM Observability

Integrate OpenAI Assistants with Helicone to monitor and analyze your assistant usage.

Assistants

OpenAI Assistants Integration

OpenAI Assistants Integration - Helicone OSS LLM Observability

Use JavaScript to integrate Azure OpenAI with Helicone to log your Azure OpenAI usage.

Azure OpenAI JavaScript Integration

Azure OpenAI JavaScript Integration - Helicone OSS LLM Observability

Use Python to integrate Azure OpenAI with Helicone to log your Azure OpenAI usage.

Azure OpenAI Python Integration

Azure OpenAI Python Integration - Helicone OSS LLM Observability

Use LangChain to integrate Azure OpenAI with Helicone to log your Azure OpenAI usage.

Azure OpenAI LangChain Integration

Azure OpenAI Langchain Integration - Helicone OSS LLM Observability

Use cURL to integrate Azure OpenAI with Helicone to log your Azure OpenAI usage.

Azure OpenAI cURL Integration

Azure-OpenAI cURL Integration - Helicone OSS LLM Observability

Use Anthropic's JavaScript SDK to integrate with Helicone to log your Anthropic LLM usage.

Anthropic JavaScript SDK Integration

Anthropic JavaScript SDK Integration - Helicone OSS LLM Observability

Use Anthropic's Python SDK to integrate with Helicone to log your Anthropic LLM usage.

Anthropic Python SDK Integration

Anthropic Python SDK Integration - Helicone OSS LLM Observability

Use LangChain to integrate Anthropic with Helicone to log your Anthropic LLM usage.

Anthropic LangChain Integration

Anthropic LangChain Integration - Helicone OSS LLM Observability

Use cURL to integrate Anthropic with Helicone to log your Anthropic LLM usage.

cURL Integration

Anthropic cURL Integration

Anthropic cURL Integration - Helicone OSS LLM Observability

Use Helicone's JavaScript SDK to log your Ollama usage.

Ollama Javascript Integration

Learn how to integrate AWS Bedrock with Helicone using JavaScript.

AWS Bedrock JavaScript SDK Integration

Use Vercel AI SDK to integrate with Helicone to log your OpenAI usage.

Vercel AI

Vercel AI SDK Integration

Vercel AI SDK Integration - Helicone OSS LLM Observability

Connect Helicone with any LLM deployed on Anyscale, including Llama, Mistral, Gemma, and GPT.

Anyscale

Anyscale Integration

Anyscale Integration - Helicone OSS LLM Observability

Connect Helicone with Together AI, a platform for running open-source language models. Monitor and optimize your AI applications using Together AI's powerful models through a simple base_url configuration.

Together AI

Together AI Integration

Together AI Integration - Helicone OSS LLM Observability

Integrate Helicone with Hyperbolic, a platform for running open-source LLMs. Monitor and analyze interactions with any Hyperbolic-deployed model using a simple base_url configuration.

Hyperbolic

Hyperbolic Integration

Hyperbolic Integration - Helicone OSS LLM Observability

Use Groq's JavaScript SDK to integrate with Helicone to log your Groq usage.

Groq JavaScript SDK Integration

Groq JavaScript SDK Integration - Helicone OSS LLM Observability

Use Groq's Python SDK to integrate with Helicone to log your Groq usage.

Groq Python SDK Integration

Groq Python SDK Integration - Helicone OSS LLM Observability

Use cURL to integrate Groq with Helicone to log your Groq usage.

Groq cURL Integration

Groq cURL Integration - Helicone OSS LLM Observability

Use Instructor's JavaScript SDK to log your LLM calls in Helicone.

Instructor JavaScript SDK Integration

Instructor JavaScript SDK Integration - Helicone OSS LLM Observability

Use Instructor's Python SDK to log your LLM calls in Helicone.

Instructor Python SDK Integration

Instructor Python SDK Integration - Helicone OSS LLM Observability

Connect Helicone with OpenAI-compatible models on Deepinfra. Simple setup process using a custom base_url for seamless integration with your Deepinfra-based AI applications.

Deepinfra

Deepinfra Integration

Deepinfra Integration - Helicone OSS LLM Observability

Integrate Helicone with OpenRouter, a unified API for accessing multiple LLM providers. Monitor and analyze AI interactions across various models through a single, streamlined interface.

OpenRouter

OpenRouter Integration

OpenRouter Integration - Helicone OSS LLM Observability

Easily log your LiteLLM API calls to Helicone using OpenLLmetry.

OpenLLmetry

LiteLLM Integration with OpenLLmetry

LiteLLM Integration with OpenLLmetry - Helicone OSS LLM Observability

Connect Helicone with LiteLLM, a unified interface for multiple LLM providers. Standardize logging and monitoring across various AI models with simple callback or proxy setup.

Callbacks

LiteLLM Integration

LiteLLM Integration - Helicone OSS LLM Observability

Connect Helicone with Fireworks AI, a platform that provides a fast, affordable, and customizable solution for generative artificial intelligence that helps product developers run, fine-tune, and share large language models.

Fireworks AI

Fireworks AI Integration

Fireworks AI Integration - Helicone OSS LLM Observability

Log any Vector DB interactions to Helicone using Helicone's Javascript SDK.

Javascript

Vector DB Javascript SDK Integration

Log any Vector DB interactions to Helicone using cURL.

Vector DB Curl Integration

Log any external tools used in your LLM applications to Helicone using Helicone's Javascript SDK.

Tools Javascript SDK Integration

Log any external tools used in your LLM applications to Helicone using cURL.

Tools Curl Integration

Combine Helicone's LLM analytics with PostHog, a comprehensive product analytics platform. View LLM insights alongside other application data for holistic performance analysis.

PostHog

PostHog Integration

PostHog Integration - Helicone OSS LLM Observability

Integrate Helicone with Ragas, an open-source framework for evaluating Retrieval-Augmented Generation (RAG) systems. Monitor and analyze the performance of your RAG pipelines.

Ragas

Ragas Integration

Ragas Integration - Helicone LLM Observability

Integrate Helicone with Open WebUI, an extensible, offline-capable interface for various LLM runners. Monitor interactions across Ollama, OpenAI-compatible APIs, and custom LLM setups.

Open WebUI

Open WebUI Integration

Open WebUI Integration - Helicone OSS LLM Observability

Integrate Helicone with MetaGPT, a multi-agent framework that simulates a software company workflow. Monitor AI-driven software development processes and agent interactions.

MetaGPT

MetaGPT Integration

MetaGPT Integration - Helicone OSS LLM Observability

Integrate Helicone with Open Devin, an AI-powered platform for autonomous software engineering. Monitor interactions between AI agents and human developers in your projects.

Open Devin

Open Devin Integration

Open Devin Integration - Helicone OSS LLM Observability

Integrate Helicone with Embedchain, an Open Source Framework for personalizing LLM responses. Monitor interactions across different LLMs in AI applications.

Mem0 Embedchain

Mem0 Embedchain Integration

Mem0 Embedchain Integration - Helicone OSS LLM Observability

Dify is an open-source LLM app development platform. Its intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. Here is how to get Observability and logs for your dify instance.

Dify LLM Observability and Logs

Dify

Leverage Upstash QStash with Helicone to make asynchronous LLM calls, perfect for serverless environments. Enhance your LLM operations with extended timeouts, retries, and batching while maintaining full observability.

QStash

Upstash QStash LLM Integration

Integrate Helicone with Upstash RAG Chat to enhance your retrieval-augmented generation applications with comprehensive observability and analytics.

Statistic	OpenAI (s)	Helicone (s)
Mean	2.21	2.21
Median	2.87	2.90
Standard Deviation	1.12	1.12
Min	0.14	0.14
Max	3.56	3.76
p10	0.52	0.52
p90	3.27	3.29

Getting Started

Dev Integrations

Other Integrations

Features

References

Latency Impact

How Cloudflare Workers Minimize Latency

Benchmarking Helicone’s Proxy Service

Results

FAQ

Getting Started

Dev Integrations

Other Integrations

Features

References

​How Cloudflare Workers Minimize Latency

​Benchmarking Helicone’s Proxy Service

​Results

​FAQ

How Cloudflare Workers Minimize Latency

Benchmarking Helicone’s Proxy Service

Results

FAQ