AgentOps Forensics · The decision

Should you self-host Hermes Agent, or move to managed?

First, both options have to clear your security, compliance, and reliability requirements. After that it is economics: compare your cost per useful run, computed from your own logs, against a managed price you were actually quoted. If self-hosting keeps a meaningful margin after retries and your time, keep it; if not, managed is usually the safer default. Monthly totals alone are not enough.

Run the free self-check the cost model behind it

✓ A framework, not a sales pitch ✓ Read from your own numbers ✓ Keep, migrate, or retire

Before the math

What has to be true before cost even matters?

The cost-per-useful-run rule: keep self-hosting only if your cost per useful run, measured from your own logs over a 30-to-60-day window that includes your spikes, stays below a managed price you were actually quoted by enough margin to pay for the operational work. If the margin is thin or negative, migrate. If neither option fits the workload anymore, retire the setup.

Cost only decides between options that already work. Before comparing a single number, check that each option clears your hard requirements. If one fails, it is out regardless of price.

Security and compliance

Where the data and the model may run, what has to be auditable, and which certifications are non-negotiable.

Latency and reliability

The response time and uptime your product needs, and whether each option holds them under load.

iii

Model and tool parity

Whether both options run the model, tools, and context you actually depend on, not a near-equivalent.

Ownership and control

Who can change the system, how fast, and whether you can carry the operational work either way.

The decision flow: calculate your run economics (the cost breakdown), map the fixed components (the per-component reference), then make the keep, migrate, or retire call on this page.

When it wins

When is self-hosting Hermes Agent cheaper than managed?

When your cost per useful run sits below a quoted managed price per run, and stays there after you count retries and your own time. That is the comparison that decides among options that passed the four feasibility gates, not monthly bill against monthly bill.

A useful run is one that produced an acceptable result, not one that errored or had to be redone. Measure cost per useful run over a trailing window that includes your spikes, not a quiet week.

Self-hosting tends to win at high, steady volume with a stable set of tools: you amortise the fixed infrastructure across many runs and you control the token overhead directly.

Managed tends to win at low or spiky volume, for small teams without spare operations capacity, or early on when the workload is still changing shape and pinning down infrastructure is premature.

The per-run figure comes from your logs, not a vendor table. The free self-check walks the same model.

Self-hosted vs managed Hermes Agent: which conditions favor which. A decision aid, not a benchmark; the prices come from your logs and a real quote.
Factor	Favors self-hosting	Favors managed
Run volume	high and steady (fixed infrastructure amortises)	low or spiky (idle infrastructure is expensive per run)
Tool set and workload	stable catalogue, settled architecture	still changing shape week to week
Operations capacity	spare capacity for reliability work and on-call	small team with none to spare
The deciding number	cost per useful run beats a real managed quote with margin for the ops work	the gap is thin or negative once retries and your time are counted

The thresholds behind this table are the cost-per-useful-run rule near the top of this page; the math lives in the Hermes Agent cost breakdown.

The true cost

What does self-hosting Hermes Agent actually cost beyond tokens?

Tokens are the visible cost. The comparison that surprises people is the rest of it:

Infrastructure divided by runs. The monthly server bill only looks cheap until volume drops; per run, idle infrastructure is expensive.

Retries and fallbacks. A retry, or a fallback path, often costs close to a full run, re-paying the fixed overhead. A flaky tool or a drifting retrieval step turns a reliability problem into a cost problem (the bill-creep drivers).

Your time. Reliability work and on-call are real costs, often the largest of the ones left out of the spreadsheet.

The honest cost of self-hosting is tokens, plus infrastructure per run, plus retries, plus your time. Compare that to a managed quote.

The decision

Keep, migrate, or retire: how do I decide?

Three outcomes, each with the condition that points to it, what it costs you, and the signal in your own numbers.

Keep self-hosting

Your cost per useful run beats a quoted managed price by a margin wide enough to pay for the operational work, and reliability is under control.

The tradeYou carry infrastructure, reliability work, and on-call, but the per-run economics and the control are worth it.

The signalHigh, steady volume with a stable set of tools, and a cost-per-useful-run gap that holds after you count retries and your time.

Migrate to managed

The cost gap is thin or negative once you count the hidden costs, or the operational burden is pulling you off the product.

The tradeA higher visible per-run price, traded for less infrastructure, no on-call, and time back.

The signalLow or spiky volume, a small team without spare operations capacity, or a workload still changing shape week to week.

iii

Retire the setup

Neither self-hosting nor your current managed plan fits any more, usually because the workload changed underneath the architecture.

The tradeThe cost of a rebuild now, against the compounding cost of paying for a setup that no longer matches the work.

The signalCost per useful run drifting up for reasons no lever fixes, or an architecture bent around a workload that no longer exists.

This framework is fully usable for free. If you want a second opinion, a written Verdict applies the same rubric to your logs and your quote and returns one call: keep, migrate, or retire.

Questions

Self-host vs managed Hermes Agent: frequently asked

What has to be true before cost even matters in this decision?+

Both options have to clear your hard requirements first: security and compliance, data residency, the latency and reliability your product needs, and parity on the model, tools, and context you depend on. If one option fails a hard requirement, cost per run is irrelevant, that option is out regardless of price. Run the cost comparison only on the options that pass.

How long should I measure before deciding to keep, migrate, or retire?+

Long enough to include your spikes, not a quiet week. A trailing window of roughly 30 to 60 days is usually enough, counting only useful runs (runs that produced an acceptable result, not ones that errored or had to be redone). A calm window flatters self-hosting; a spike flatters managed, so measure across both.

When should I migrate from self-hosted Hermes to managed?+

When the cost-per-useful-run gap is thin or negative after you count the hidden costs, or when the operational burden is pulling you off your product. Spiky or low volume and small teams without spare operations capacity usually point to managed.

What is the keep, migrate, or retire decision for Hermes Agent?+

A three-way call read from your numbers, after both options clear your hard requirements: keep self-hosting if the per-run economics and control justify the operational work; migrate to managed if the gap is thin or operations are a drag; retire the current setup if the workload changed and neither option fits, which means rebuild or drop it. A written Verdict applies the same rubric to your logs.

Is self-hosting Hermes Agent worth it?+

Only when your measured cost per useful run beats a real managed quote by enough margin to pay for the operational work. High, steady volume with a stable set of tools is the case most likely to make self-hosting worth it. If the margin is thin or negative once retries and your time are counted, managed is the safer call.

Who writes this

byJed, building and operating production software since 2007 and running self-hosted Hermes workflows since February 2026. The decision framework is given away here; a paid Verdict applies the keep, migrate, or retire tests to your specific numbers, read from your logs.

Issue #4379 is cited as an input and an example, not a benchmark baseline. This page is a decision framework, not a benchmark; managed prices vary, so compare against a quote you were actually given.

Run the free self-check~5 min · no signup · in your browser Start