Inference Activity is an alpha add-on and is invite only. If you have an invitation, install this add-on via the CLI. Read More

Inference Activity Alpha

See how you are using Heroku Inference’s AI models. Starting at ~$0/hour.

Activity Usage in Real Time

Inference Activity logs key details about requests and responses between your application and the Heroku AI Inference API. Monitor token usage, model performance, and API calls in real-time with detailed logs, graphs, and metrics like speed, latency, throughput, and resource utilization.

Cost Optimization

Constantly monitoring tokens in and tokens out, is not just important — it’s critical for identifying inefficiencies, minimizing waste, and keeping expenses in check. Optimize prompts, choose cost-effective models, and manage API usage to prevent unexpected costs and maximize efficiency.

Alerting & Notifications

Set threshold-based alerts on API usage, total cost, and response times to catch issues before they escalate. Get instant notifications via email, Slack, or webhooks to prevent overruns and maintain control.

Region Availability

The available application locations for this add-on are shown below, and depend on whether the application is deployed to a Common Runtime region or Private Space. Learn More

  • Common Runtime
  • Private Spaces
Region Available
United States
Europe
Region Available Installable in Space
Dublin
Frankfurt
London
Montreal
Mumbai
Oregon
Singapore
Sydney
Tokyo
Virginia

Plans & Pricing

    • Data Retention 30 days
    • Alerts
    • Notifications
heroku addons:create inference-activity

This add-on is in alpha and can only be provisioned if you have been invited by this add-on partner.
To provision, copy the snippet into your CLI.

Inference Activity Documentation