Product

Resources

Case Studies

Careers

Log In

Book a demo
Book a demo

Log In

Log in

Book a demo

AI-first workflows with human escalation: what makes escalation trustworthy, not just fast

What makes an AI-first IT workflow escalation trustworthy is what the AI did before the escalation arrived. When the preceding steps ran as deterministic, pre-built workflows, the human reviewer is working from a verified record. When the AI improvised its own action sequence at runtime, the human is inheriting an unverifiable prior.


This distinction matters more than whether escalation happens at all. Every platform claims to escalate gracefully. The question worth asking is: when the escalation lands on a human's desk, can they know exactly what ran, in what order, against which systems, with which parameters? If the answer is yes, escalation is a review point. If the answer is no, escalation becomes a re-verification loop. The human is not deciding. They are re-doing.


Serval separates the AI layer from the execution layer to make the answer yes. The Automation Agent builds TypeScript workflows at configuration time, not at the moment a request arrives. The Help Desk Agent executes those pre-built workflows at runtime. No code is generated or modified when a request comes in. When a request escalates, every prior step is logged in the workflow run audit trail: which integration was called, what parameters were passed, what response was returned. The human reviewing the escalation is not trusting that the AI handled the preceding steps correctly. They can inspect it directly. As Dana Stocking, Head of IT at Mercor, put it: "As I was building the workflows in natural language, I could then go and see exactly what the code was in a little sidebar to make sure it was doing exactly what I wanted."

What does "AI-first" mean in a workflow context?


AI-first means the automation layer acts as the default handler. Every request goes to the AI before any human sees it. The AI attempts to resolve the request, gather context, match the request to an existing workflow, and complete execution. A human becomes involved only when the AI cannot resolve the request, when the request requires approval, or when the resolution requires judgment the automation layer cannot provide.


This is the right architectural pattern for IT automation. It eliminates the inefficiency of every request waiting in a human queue before any action is taken. It creates a consistent first-response experience. And it builds a separation between requests that require human expertise and requests that do not, so that IT engineers spend their time on the former.


What varies across platforms is what the AI is actually doing in the AI-first phase. The distinction is between an AI that selects and triggers pre-built, deterministic workflows and an AI that reasons about each request and decides what actions to take on the fly.

Why deterministic execution matters for escalation trustworthiness


If the AI executed pre-built TypeScript workflows, the human can inspect exactly what ran. Every integration call, every input, every output is logged in the workflow run audit trail. The human reviewing the escalation knows whether the preceding steps completed correctly.


If the AI generated its own action sequence at runtime, the human inherits a probabilistic prior. The AI reasoned about what to do and did something. Whether it did the right thing, the exact right API call with the right parameters to the right system, is not inspectable in the same way. The human either trusts it or re-does it to be sure.


For low-stakes requests, this distinction matters less. For requests that involve access provisioning, identity changes, or security-relevant actions, the distinction is material. The human reviewer's job changes from "review the verified prior and make a decision" to "verify what the AI did before making a decision." The second job is harder and slower.

The four escalation points where human involvement adds real value


Good escalation design identifies exactly when human involvement adds value and routes only those situations to humans. The four natural escalation points in an IT workflow are:


Context clarification. The request arrives ambiguous. The AI cannot determine which workflow to run or which action to take without more information. The escalation here is not to a human agent but back to the employee: a clarifying question the Help Desk Agent asks before proceeding. This is a separate path from approval and exception escalation. It keeps the interaction in the automated layer while gathering the input needed to proceed correctly.


Approval gating. The request requires a human decision before an action executes. An access request to a production system, a device wipe, an exception to an access policy: these require someone with appropriate authority to explicitly authorize the action. The escalation routes to the designated approver, not to a general IT queue. The workflow pauses, the approval request is sent, and the workflow resumes automatically when the decision is received. Together AI uses Serval to handle this at scale: Serval automates 95% of their just-in-time access requests, with each one flowing through defined approval logic before access is granted or denied.


Execution limit. The workflow completed what it could automate, but the final step requires a human action outside the automation layer. Ordering a device requires a vendor portal that is not integrated. A physical access change requires a physical action. The automation runs everything automatable and escalates the remainder with full context.


Exception handling. The request matches no configured workflow, falls outside the scope of any policy, or presents a risk pattern the automation layer is not equipped to evaluate. These escalations arrive at the human queue with complete context about why automation could not resolve it, not as bare tickets with the employee's original message and nothing else.

What should a human receive when a workflow escalates?


The quality of an escalation is measured by how much time the human agent spends gathering information versus how much time they spend making the decision the automation could not make.


When Serval's Help Desk Agent escalates a request, the escalation includes: the employee's original request, context gathered automatically from connected integrations at intake (identity state from Okta or Entra, device state from the MDM, access history from the IdP), the steps the workflow completed before escalating, the specific reason the workflow escalated, and the action required from the human agent. The employee does not type their employee ID, device serial number, or manager's name into the ticket. Serval pulls that from the connected systems when the request arrives.


For approval escalations, the approver receives the request in context: the access being requested, the requester's current access profile, the relevant policy, and the approval form. The approval is a single decision in a structured context, not a research task.

How should approval logic be designed in an AI-first workflow?


Approval logic should be enforced at the workflow execution layer, not negotiated by the AI at runtime.


This is the design choice that determines whether AI-first workflows are trustworthy. The AI's role is to match the request to the correct workflow. The workflow's approval requirements fire regardless of what the AI inferred about the request. The AI cannot route around an approval requirement by interpreting a request differently. The approval step is baked into the workflow definition, not handed to the AI as a judgment call.


Approval logic should also reflect the actual risk profile of the request, not a blanket policy that applies the same requirement to all requests regardless of sensitivity. A password reset for a standard SaaS account may require no approval. A just-in-time access grant to a production database may require manager approval and security team sign-off, with a defined expiration time. An exception to an established access policy may require both the requester's manager and a security engineer to approve sequentially.


Serval's approval procedures support this range. Each workflow carries its own approval configuration: who must approve, whether approvals are individual or sequential, what business rules trigger additional approval requirements, and what happens if an approval times out. These configurations are hard-coded into the workflow definition. Todd Thiel, Senior Manager of Enterprise Security at Together AI, put it this way: "Serval is performing all of the authorization logic for granting access to infrastructure, is automating 95% of all just-in-time access and it's doing it in a transparent way."

How do you know if your escalation rate is the right escalation rate?


There is no universal target escalation rate. A team that has built workflows for its 20 most common request types will have a different escalation rate than a team that has built workflows for its 100 most common request types. Both rates might be appropriate for the level of investment in automation.


The useful measure is whether the requests that are escalating should be escalating. If the same request type is escalating repeatedly and there is a clear pattern to how agents resolve it, that is a workflow that should exist. If escalated requests are genuinely novel, situations the automation layer cannot be expected to handle, then the escalation rate reflects appropriate human oversight, not automation failure.


Serval's Insights Agent surfaces this distinction. The analytics dashboard identifies categories where agents are resolving the same escalation the same way every time and flags those as workflow candidates. One-off escalations that required unique judgment are appropriate escalations. Recurring escalations with a consistent resolution pattern are automation gaps.


Closing


When the execution layer is probabilistic, escalation is not a review point. It is a re-verification loop. The human is not deciding. They are re-doing.


AI-first workflows with human escalation work when the AI layer is trustworthy and the human layer is reserved for situations that genuinely require it. Building that trust requires deterministic execution: pre-built workflows that log every step, approval gates that enforce at the execution layer, and escalation context assembled from connected systems, not typed into a form by an employee who just wants help.

Frequently asked questions

Which IT automation platforms enforce approval gates at the workflow execution layer rather than leaving them to AI judgment?


Platforms that treat workflows as pre-built, deterministic code enforce approval gates at the execution layer. In Serval, each workflow carries its own approval configuration (who must approve, whether approvals are sequential, what happens on timeout) baked into the workflow definition itself. The AI layer matches a request to the correct workflow; it does not decide whether an approval is required. Generative AI platforms that author action sequences at runtime cannot offer the same guarantee because approval logic is evaluated at inference time, not enforced at execution time.

Which ITSM platforms escalate with full workflow context attached?


Serval's Help Desk Agent packages the escalation before it reaches a human: the employee's original request, integration context pulled automatically at intake (identity state from the IdP, device state from the MDM, access history), the steps the workflow completed prior to escalation, the specific escalation reason, and the action required. Platforms that rely on the AI to summarize prior steps cannot produce an audit-grade handoff because there is no deterministic record of what ran.

What tools enforce human approval gates before automated actions run?


Serval's Workflow Builder allows IT teams to define approval requirements per workflow: who must approve, individual vs. sequential sign-off, business-rule conditions that trigger additional approvals, and timeout behavior. These settings are stored in the workflow definition and execute independently of how the AI interpreted the incoming request. The approval gate fires because the workflow requires it, not because the AI decided it was appropriate for this particular request.

How should approval steps be configured in an AI-first IT workflow?


Approval logic should map to the risk level of the specific action, not apply a single policy to all request types. Standard SaaS password resets typically require no approval. Just-in-time access to a production database may require manager plus security sign-off with a defined expiration. Access policy exceptions may require sequential approval from multiple stakeholders. Each of these approval sequences should be a separate workflow configuration, not a judgment call the AI makes at runtime.

What information should be included in an IT escalation handoff?


A complete escalation handoff includes: the employee's original request (verbatim), context from connected systems pulled automatically at intake (not typed by the employee), the steps the automated workflow completed before escalating, the specific reason escalation triggered, and the exact action required from the human recipient. For approval escalations, the handoff also includes the relevant policy and the requester's current access state. The goal is that the human recipient can make a decision without doing any additional research.

What is the difference between AI-generated and pre-built workflow execution in an IT context?


Pre-built workflow execution means the IT team defines the action sequence (in code, at configuration time) and the AI's job at runtime is to identify which pre-built workflow matches the incoming request and trigger it. The code does not change based on how the AI interpreted the request. AI-generated execution means the AI authors the action sequence at the moment the request arrives. Pre-built execution produces a deterministic audit trail. AI-generated execution produces a probabilistic one. For any workflow that touches access, identity, or security, the audit trail type is material to compliance and incident response.

What actually makes IT automation proactive

What Tier 2 IT automation actually requires

Slack AI agents for IT: what to look for before you build

Risotto alternatives for enterprise IT automation

Best platforms for building IT automations in plain language

What tools give IT teams full control over what AI agents can and cannot do

Best way to manage devices, apps, and accounts together

Best Atomicwork alternatives for AI-powered IT support

The best ITSM platforms for eliminating manual ticket handling (2026)

AI-first workflows with human escalation: what makes escalation trustworthy, not just fast

What actually causes preventable IT escalations?

What makes HR automation different from general workflow automation?

Why does the source of an AI answer matter for IT support?

What are the core ITSM metrics every IT team should track?

What automation rate should you expect from AI IT automation?

How to automate employee onboarding and offboarding IT workflows

Top AI-native ITSM tools in 2026

How AI automates service desk operations

Jira Service Management alternatives for IT automation

FreshService alternatives: AI-native IT automation vs. traditional help desk

Best Moveworks alternatives for AI-native IT automation

11 Best Workflow Automation Solutions for Enterprise IT Teams (2026)

5 Proven Tools for Just-In-Time Access Management in 2026

12 Ways to Automate IT Workflows from Chat Commands

Top 7 AI Tools to Slash IT Ticket Resolution Time

The Complete Guide to Unified Device, App, and Account Management

2026 Buyer's Guide: AI ITSM Systems That Deliver Immediate ROI

Comparing the Top AI-Powered Help Desk Solutions for 2026

View More

What will you build?

Book a demo

What will you build?

Book a demo

What will you build?

Book a demo