Inference Activity is an alpha add-on and is invite only. If you have an invitation, install this add-on via the CLI. Read More

Inference Activity Alpha

See how you are using Heroku Inference’s AI models. Starting at ~$0/hour.

Activity Usage in Real Time

Inference Activity logs key details about requests and responses between your application and the Heroku AI Inference API. Monitor token usage, model performance, and API calls in real-time with detailed logs, graphs, and metrics like speed, latency, throughput, and resource utilization.

Cost Optimization

Constantly monitoring tokens in and tokens out, is not just important — it’s critical for identifying inefficiencies, minimizing waste, and keeping expenses in check. Optimize prompts, choose cost-effective models, and manage API usage to prevent unexpected costs and maximize efficiency.

Alerting & Notifications

Set threshold-based alerts on API usage, total cost, and response times to catch issues before they escalate. Get instant notifications via email, Slack, or webhooks to prevent overruns and maintain control.

The available application locations for this add-on are shown below, and depend on whether the application is deployed to a Common Runtime region or Private Space. Learn More

Region	Available
United States
Europe

Region	Available	Installable in Space
Dublin
Frankfurt
London
Montreal
Mumbai
Oregon
Singapore
Sydney
Tokyo
Virginia

Test Free

Need a larger plan? Let our customer success team help! Learn more.

- Data Retention 30 days
- Alerts
- Notifications

$ heroku addons:create inference-activity

This add-on is in alpha and can only be provisioned if you have been invited by this add-on partner.
To provision, copy the snippet into your CLI.

View add-on docs on DevCenter

Inference Activity Alpha

Activity Usage in Real Time

Cost Optimization

Alerting & Notifications

Region Availability

Plans & Pricing

Documentation

Quick Links

Addon Sharing

Add-on Category

Supported Languages

Generation Support