Alert Types
Error Rate Alerts
Track the percentage of failed requests over a time window. Essential for maintaining application reliability. Use cases:- Detect provider outages or rate limiting issues
- Catch breaking changes in prompts or model behavior
- Monitor deployment health after updates
- Identify patterns in user inputs causing failures
Cost Alerts
Monitor spending to prevent budget overruns and detect unusual usage patterns. Use cases:- Prevent unexpected bills from runaway processes
- Track per-environment spending (dev/staging/prod)
- Detect potential abuse or misconfiguration
- Monitor cost trends for specific features or users
Creating Alerts
Navigate to Settings → Alerts in your Helicone dashboard to create new alerts.
Helicone Alerts Dashboard showing configured alerts and their status
1
Select alert type and threshold

Creating a new alert in Helicone
- Percentage threshold: 1-100% (5-10% recommended for production)
- Tracks ratio of failed requests to total requests
- Failed requests include 4xx/5xx errors and timeouts
- Dollar amount threshold (e.g., 500, $1000)
- Tracks cumulative spend within time window
- Includes all model costs across providers
2
Configure time window
Choose how long to evaluate the metric:
- 5-15 minutes: Immediate detection, higher false positive rate
- 30-60 minutes: Balanced approach (recommended for most apps)
- 2-4 hours: Sustained issues only, fewer false positives
- Daily/Weekly: Budget tracking and long-term trends
Shorter windows detect issues faster but may trigger during brief spikes. Longer windows reduce noise but delay detection.
3
Set minimum request threshold
Prevent false positives during low traffic periods:
- Development: 5-10 requests minimum
- Staging: 10-20 requests minimum
- Production: 20-50 requests minimum
Always set a minimum request count to avoid alert fatigue. A single failed request during low traffic shouldn’t trigger a 100% error rate alert.
4
Configure notifications
Choose where alerts are sent:
- Email: Add any email address (immediate delivery)
- Slack: Select connected channels (#alerts, #engineering, etc.)
- Multiple recipients: Add several emails or channels per alert
Start with conservative thresholds (higher error %, longer windows) and tighten based on actual patterns. This prevents alert fatigue while you learn your app’s normal behavior.

Example of a configured cost alert
Notification Channels
Dashboard
All alerts appear in your Helicone dashboard with real-time status updates. When an alert triggers, you can immediately see affected requests and investigate the issue.
Alert triggered view in the dashboard
Email Notifications
Add any email address to receive alerts. Emails include:- Alert type and threshold that triggered
- Current metric value and trend
- Direct link to affected requests in dashboard
- Time window and request count

Example alert notification email
Slack Integration
When creating or editing an alert:- Select Slack as the notification method
- Click Connect Slack button that appears
- Authorize Helicone in your Slack workspace
- Select a channel from the dropdown (#alerts, #engineering, etc.)
Configuration Examples
Production Monitoring
Development Environment
Advanced Features (Coming Soon)
Soon you’ll be able to create massively customizable alerts:- Custom aggregations - Alert on any metric (P95 latency, token usage, specific error codes)
- Advanced filters - Combine multiple custom properties with AND/OR logic
- Complex thresholds - Percentage changes, rolling averages, anomaly detection
- Custom webhooks - Send alerts to any endpoint