How does Spring Agents work?

Sign up and get a webhook URL, add it to PagerDuty's Generic Webhooks (V3), and when an incident fires, Spring Agents generates 3 ranked root cause hypotheses with confidence scores and posts them as notes on the incident within seconds.

How long does setup take?

Under 5 minutes. Create an account, copy your webhook URL, and paste it into PagerDuty. No code changes to your stack required.

Is there a free tier?

Yes. The free plan includes 25 AI-analyzed incidents per month with no credit card required.

the root cause copilot for pagerduty

Incident analysis with real infrastructure context posted straight into PagerDuty

Spring Agents pulls live metrics from your cloud environment, detects capacity patterns, and delivers ranked root causes with FinOps insights, all as a structured note on the incident your team is already working. No new tools. No data pipelines.

Start free: 25 incidents/month See an example

No credit card required. Set up in under 5 minutes.

After years running production cloud platforms and fighting PagerDuty incidents at 3 AM, we built Spring Agents for ops teams who need AI analysis that actually works.

Monitoring grafana, datadog, etc.

PagerDuty alert fires

Spring Agents enrich + analyze

PagerDuty note posted

Faster fixes,
fewer repeats better outcomes

How it works

Connects via webhook. Enriches with live cloud metrics. Posts structured analysis back to the incident. Under five minutes to set up.

PagerDuty sends the webhook

Cloud context gathered automatically

Resource identifiers are extracted from the alert. Real metrics pulled from Azure Monitor: CPU, disk, vCore usage, cost data.

Structured analysis posted to the incident

Three ranked root causes with confidence scores, numbered runbook steps, FinOps insights, and cost context, posted directly onto the PagerDuty incident within seconds.

What your team sees

This is the full analysis posted directly onto your PagerDuty incident. One note, everything your on-call needs.

Insight via Spring Agents · springagents.tech

summary

Disk space critical on PRODDATA02. F: drive at 94% (3.2 GB free on 50 GB). Log growth from /var/log consuming 18 GB with no rotation policy in place.

cloud metrics

CPU: 28%

Memory: 62%

Disk: 94%

Free: 3.2 GB

ranked root causes

#1 HIGH Unchecked log growth filling disk

/var/log consuming 18 GB with no rotation configured. Growth rate suggests full disk within 36 hours without intervention.

Confidence

91%

#2 Batch job wrote excessive temp data

Recent batch process may have dumped to /tmp. 5 GB in temp files detected.

Confidence

35%

#3 False positive, transient monitoring spike

Single threshold breach, disk usage stable in prior 24h.

Confidence

12%

recommended runbook steps

Check largest directories: du -sh /var/log/* | sort -rh | head -10
Review log rotation config: cat /etc/logrotate.d/*
Clear stale temp files: find /tmp -mtime +7 -delete

finops insight

Capacity exhaustion imminent: 3.2 GB free on 50 GB volume. Resizing to 100 GB costs ~$4/mo on Azure. Reactive expansion during an outage costs 2-3x more in engineer time and emergency change windows.

Spring AgaaS · Receipt: SA-7F2A1C

👍 Helpful 👎 Not helpful ✏️ Correct it

The intelligence loop

Every correction makes the next analysis smarter.

Incident arrives

AI generates 3 hypotheses

Engineer gives feedback

Accuracy improves

312 corrections applied

94.2% current model accuracy

Built for IT Operations teams

Not another dashboard. Cloud-aware analysis that works inside the incidents you already have.

Cloud-aware root causes

Real CPU, disk, memory, and vCore metrics pulled from Azure Monitor before hypotheses are generated. Not guessing from alert text alone.

CPU 28% Memory 62% Disk 94% Cost $312/mo

FinOps insights on every incident

Chronic under-provisioning, cost-of-inaction estimates, and right-sizing surfaced with dollar figures.

$4,800/mo savings flagged

Human in the loop

Thumbs, or a one-click correction. Your feedback trains the model for your environment.

Incident-type specialization

Purpose-built analysis per alert type. Each gets tailored metrics and runbook steps.

Azure SQL Disk capacity CPU saturation Memory pressure

Scannable structured notes

Summary, ranked causes, confidence scores, runbook steps. Designed to read at 3am.

30s read at 3am

Zero infrastructure changes

Connects via webhook. Posts notes through your existing PagerDuty key. Optional Azure credentials unlock real metrics.

0 things to install, host, patch, or maintain. Auth via API key only.

Incidents analyzed

Avg response time

AI accuracy rate

Est. hours saved

Built on PagerDuty's official webhook API

Connects in under 5 minutes. AI analysis posts directly onto your incidents as notes.

PagerDuty

Simple pricing

Pay for what you use. Upgrade or downgrade anytime.

Free

25 incidents / month

AI incident analysis
PagerDuty note posting
Usage dashboard

Get started

Starter

$49/mo

150 incidents / month

Everything in Free
Webhook signature verification
Email support

Get started

Growth

$149/mo

500 incidents / month

Everything in Starter
Priority support
Custom root cause categories

Get started

Enterprise

$399/mo

1,500 incidents / month

Everything in Growth
Dedicated support
SLA guarantee

Need unlimited? Contact us about Enterprise Plus.

HMAC-signed webhooks Encrypted in transit & at rest Rate-limited endpoints CSRF protection

Better context on every incident your team already responds to

Live infrastructure metrics, ranked hypotheses, and the cost context to justify fixes before the next page. Works inside PagerDuty. Nothing new to deploy. 25 free incidents to start.

Get started free