Platform investment analysis

Agentic Service Ticket Resolver — build once vs. configure per use case
Build once, never touch
23 components
Configure per new use case
7 items
Self-learns from every ticket
7 systems
ComponentInvestmentNew use case impactEffort per case
Platform engine — build once, applies to every use case forever
State machine + orchestrator
10 states, 4 phases, run lifecycle
one-time buildNo change. Same engine for any ticket, service, or domain.Zero
Policy guard (28 gates)
Authority, safety, budget, approval
one-time buildNo change. Gate logic is universal. Thresholds are config.Zero
Agent runtime + THINK step
ReAct loop, evidence chain, budgets
one-time buildNo change. Executes any agent role for any problem.Zero
MCP gateway + tool execution
Dispatch, sanitize, audit, idempotency
one-time buildNo change. Runs any registered tool.Zero
Content pipeline + retrieval
Ingest, chunk, embed, hybrid search, rerank
one-time buildNo change. Processes any document, searches any content.Zero
Knowledge creator + feedback loop
Episode, KB draft, skill draft, graph update
one-time buildNo change. Learns from any resolved ticket.Zero
UI, streaming, observability
Live feed, audit, tracing, metrics
one-time buildNo change. Renders any event from any investigation.Zero
Crash recovery + caching
Watchdog, heartbeat, 9 cache layers
one-time buildNo change.Zero
Data sanitizer (PII/credentials)
Pre-LLM scanning and redaction
one-time buildNo change. Patterns are universal.Zero
Zero-knowledge strategy
Cold start, architecture discovery
one-time buildNo change. Works for any unknown service automatically.Zero
Per-use-case configuration — add a new service in under 2 hours, no code
Service connection registry
How to connect: host, auth, method
configureOne database row per service: connection type, host, auth reference, allowed commands, log locations.15 – 30 min
Command templates
Diagnostic + remediation commands
configureJSON entries: command ID, parameterized template, risk level. Agent never writes raw bash.15 – 60 min
Graph seed data
Service, team, dependencies
configureCSV upload or API call. After first ticket resolves, graph grows automatically.15 – 30 min
KB documents (runbooks)
Troubleshooting guides, SOPs
configureImport from Confluence/wiki or write new. Optional day 1 — system works without them.0 – 4 hr
Skills (SKILL.md)
Procedural investigation guidance
configureWrite or review auto-generated drafts. Optional day 1 — system runs skillless.0 – 8 hr
Policy overrides
Thresholds, approvals, time windows
configureYAML or DB row per tenant/service. Optional — defaults work.0 – 10 min
Metric baselines
Normal CPU, memory, connections
configureDB row per service + metric. Can auto-learn from monitoring history.0 – 10 min
Self-learning — improves with every resolved ticket, zero ongoing effort
Episodic memory
Past incident records
auto-learnsEvery resolution becomes a reference for future agents. Dead ends are remembered.Zero
Graph failure patterns
Failure frequency, fix success rates
auto-learnsAgent tries the most common failure first. Fix success rates guide resolution.Zero
KB + skill auto-drafts
Runbooks and skills from resolutions
auto-learnsSystem drafts new docs after every skillless run. Human reviews and publishes.15 min review
Quality scores
KB article effectiveness tracking
auto-learnsHelpful articles get boosted. Misleading articles get penalized and flagged.Zero
Confidence calibration
Per-service threshold tuning
auto-learnsAfter 5+ runs, thresholds auto-adjust based on actual accuracy per service.Zero
Repository profiles
Code structure maps
auto-learnsAuto-generated from GitHub repo scanning. Improves code analysis accuracy.Zero
Investigation patterns
Optimal diagnostic sequences
auto-learnsWeekly analysis discovers recurring patterns. Suggests new skills automatically.Zero
Bottom line for business: The platform is a one-time engineering investment. Adding a new service requires configuration only — no developers, no deployments, no code changes. A new service can be onboarded in 45 minutes (minimum) to 2 hours (with full setup). After the first few tickets resolve, the system teaches itself — auto-generating runbooks, skills, and failure patterns that make every future run faster and cheaper. Moving to an entirely new technology domain (e.g., from MuleSoft/SAP to Kubernetes/AWS) requires zero platform code changes — only new configuration and knowledge content.