A buyer's decomposition

Robotic process automation with UiPath, decomposed into the three products you actually license.

UiPath calls itself RPA software. Mechanically, it is three separate products that you license, version-match, and operate together: Studio, the authoring IDE; Robot, the runtime that replays a recorded workflow on a Windows session; and Orchestrator, the central queue, scheduler, and credential store. This page walks the mechanical job each one does, where each one fails in production, and what an accessibility-API executor swaps in for each piece. The numbers come from an open-source Rust runtime you can read line by line.

Matthew Diakonov, Written with AI

Published April 30, 202611 min

Direct answer (verified 2026-04-30)

UiPath is the largest commercial RPA vendor. Its platform is three separately licensed products: UiPath Studio (the developer IDE for authoring workflows), UiPath Robot (the Windows runtime agent that executes them), and UiPath Orchestrator (the central server for queuing, scheduling, credentials, and audit). A workflow is a .xaml file authored in Studio, published as a NuGet package to Orchestrator, and replayed by a Robot against the recorded selectors. Verified against uipath.com/rpa/robotic-process-automation on 2026-04-30.

The shape of a UiPath deployment

Most explanations of UiPath conflate the three products. The marketing surface treats “RPA software” as one category. Inside an enterprise deployment, the three live on different machines, on different release schedules, and on different cost lines in the operations budget. A purchase order for a UiPath rollout names all three and calls out a separate line for the AI bolt-ons (Document Understanding, AI Center, the newer Agent Builder).

The flow looks like this: a developer authors a .xaml in Studio, publishes it as a NuGet package to Orchestrator, and Orchestrator schedules a Robot to pull and replay it. The diagram below is three nodes because the architecture is three nodes.

UiPath's three-product runtime

Studio

drag activities, pick selectors, save .xaml

Robot

replay the .xaml against a Windows session

Orchestrator

queue runs, hold creds, log audits

What happens in production: the four-stage workflow lifecycle

The textbook UiPath lifecycle has four steps. Three of them feel normal to an engineer who has shipped any kind of software: authoring, publishing, executing. The fourth, remediation, is the one that decides whether your RPA Center of Excellence is paying for itself. It is also the one most marketing pages skip past.

The lifecycle of a single workflow, end to end

1. Authoring in Studio

A certified developer opens UiPath Studio, drags activities (Click, Type Into, Read Range, Get Text) onto a canvas, and uses the recorder to capture the target element. Studio writes a .xaml file describing the workflow. Selectors are XPath-like expressions over UI elements: <wnd app='SAPGUI' cls='SAP_FRONTEND' /> or partial selectors that match by aaname. Studio is licensed per developer seat.

2. Publishing to Orchestrator

When the .xaml is ready, Studio publishes a NuGet package to Orchestrator. Orchestrator stores it, versions it, and binds it to a process. Orchestrator is the central server: a database, a queue, an asset store for credentials, and a scheduler. It runs on a license tier of its own (Standard, Enterprise, Cloud).

3. Execution by a Robot

A UiPath Robot is the runtime agent installed on a Windows machine, attended (next to a human) or unattended (on a dedicated VM). When Orchestrator triggers a process, the Robot pulls the package, instantiates the workflow, and walks the activities one at a time. Each Click activity asks the Windows UI Automation API to find the element matching the recorded selector.

4. Failure and remediation

If the selector misses (a label rename, a control id change, a panel reorder), the activity throws and the run fails. The failed run lands in Orchestrator's queue with a stack trace. A developer reopens the .xaml in Studio, re-runs the recorder against the new UI, fixes the selector, republishes, and the cycle restarts. This is the maintenance loop every UiPath buyer eventually pays for.

Why selectors break, and what that costs

The Robot is, mechanically, a function from (workflow.xaml, target Windows session) to a sequence of UI Automation calls. The function is deterministic by design. The non-deterministic part is the target session: applications change, labels rename, layouts shift, DPI scales differ, a SAP transport ships a small panel reorder, a banking core vendor pushes a quarterly UI patch. When the live UI no longer matches the recorded selector, the Robot does the only thing it can do, which is throw.

UiPath ships a UI Explorer to help a developer rebuild the selector after the throw, and the activity library has a “Use Application/Browser” wrapper that adds retries and waits. Neither changes the underlying matching strategy: there is one selector per activity and it must match. When it does not, the run fails. Multiply that across an enterprise with two hundred workflows and a quarterly application update cycle, and the maintenance backlog stops being a footnote and starts being a roadmap.

This is the single largest reason UiPath rollouts stall. It is not the licensing cost, which is bounded. It is the developer time it takes to keep two hundred selector-based workflows alive against a UI estate that updates without coordinating with you.

4 strategies

“The four-strategy match cascade in apps/desktop/src-tauri/src/focus_state.rs, lines 168 to 196, walks automation id, then window-plus-bounds, then visible text, then parent window. Most application updates are absorbed by one of the first three. A single selector miss does not fail the run; the architecture treats UI shift as expected, not exceptional.”

focus_state.rs, github.com/mediar-ai/terminator

What an accessibility-API executor swaps in for each piece

Once you decompose UiPath into Studio, Robot, Orchestrator, and the selector, each of the four pieces has a concrete replacement. The replacements are not bigger or fancier; they are different in shape. Studio (a canvas) is replaced by a recording session. Robot (a closed-source agent with a selector engine) is replaced by an open-source Rust crate that walks four match strategies. Orchestrator (a separately licensed server) is collapsed into a Postgres queue with a small API. The selector (one XPath per activity) is replaced by a four-strategy cascade.

The replacements below are not architectural sketches. Each one cites the file in the open-source Mediar repository (under MIT on github.com/mediar-ai/terminator) where the work happens.

Studio

The drag-and-drop authoring IDE plus selector recorder. Replaced by a recording session: a human runs the workflow once on Windows, the desktop recorder admits six event types (button_click, browser_click, text_input_completed, browser_tab_navigation, application_switch, file_opened) defined by is_meaningful_event_type in apps/desktop/src-tauri/src/recording_processor.rs at line 250. A Gemini Vertex AI pipeline converts that recording into a TypeScript workflow file. There is no canvas to drag onto.

Robot

The runtime that replays selectors. Replaced by an 871-line Rust crate at crates/executor/src/services/typescript_executor.rs that pulls the workflow from a queue and calls MCP execute_sequence against a Windows session. No model is in this hot path. The crate has zero references to gemini, claude, openai, or any inference library. Open source under MIT at github.com/mediar-ai/terminator.

Orchestrator

Central queue, scheduler, asset store, audit log, license server. Compressed into a Postgres-backed queue, a small set of API endpoints, and a TypeScript file in source control as the authoring artifact. SOC 2 Type II and HIPAA controls live with the cloud deployment, not in a separately licensed product.

The selector

Studio's recorded XPath-like expression. Replaced by a four-strategy match cascade in apps/desktop/src-tauri/src/focus_state.rs, lines 168 to 196. Strategy 1: automation/accessibility id. Strategy 2: window plus bounds. Strategy 3: visible text. Strategy 4: parent window focus. Most UI shifts are absorbed by one of the first three. A single selector miss does not fail the run.

The runtime, in code

The single uncopyable fact for a UiPath buyer evaluating an alternative is the size and shape of the production runtime. A UiPath Robot is a multi-megabyte Windows installer with selector engines for Win32, browser, Java, SAP, mainframe, and PDF. The Mediar production executor is a single Rust file, exactly 871 lines, at:

crates/executor/src/services/typescript_executor.rs

Its job, end to end, is: pull a workflow off a Postgres queue, build the MCP arguments (file URL, secrets, trace id), and call the MCP execute_sequence tool against a Windows session. The MCP server in turn wraps the Terminator SDK, which is the part that actually talks to Windows UI Automation, the same accessibility surface a screen reader consumes.

Grep the file for gemini, openai, claude, or anthropic: zero matches. The model is gone from the runtime by the time the executor runs. The intelligence is in the authoring layer that emits the TypeScript file in the first place. That split, AI at authoring time and plain Rust at runtime, is what most enterprise compliance teams want when they say they will not accept a model in the hot path.

The legacy applications this matters for

The four-strategy cascade and the accessibility-API approach were built for a specific class of applications: Windows desktop systems that emit accessibility trees but no APIs and no stable XPath selectors. Browser-based AI agents do not help here because the data does not live in a browser. UiPath does help, by virtue of having shipped Win32 and SAP and mainframe selector engines for over a decade, but the maintenance cost on these targets is also where UiPath rollouts stall the most.

SAP GUIOracle EBSJack Henry SilverLakeFiserv DNAFIS IBSEpic HyperspaceCerner PowerChartMainframe TN3270JD EdwardsSage 100QuickBooks DesktopSalesforce Classic

The buying decision, written down

For an organization with a mature UiPath Center of Excellence, stable workflows that have run unchanged for two years, and a roadmap that does not require new automation against new applications, UiPath is doing its job and a migration is churn. Keep it.

For an organization that has paid for UiPath and watched it stall, where the queue of automation requests is longer than the team that ships them, where new applications keep landing on the roadmap (a banking core upgrade, a new SAP module, a switch from Cerner to Epic), the decomposition above maps to an alternative architecture. Each piece of UiPath has a mechanical replacement. The replacement is not a feature checklist match; it is a different shape that costs less per workflow and adapts when the UI changes underneath.

The honest pitch is: keep UiPath running for the workflows it already runs, route new automation requests to an accessibility-API executor, and let the maintenance cost of each architecture make the long-term decision for you.

Bring a stalled UiPath workflow to a working call.

We will record the same target on Windows, run it through the four-stage authoring pipeline live, and replay the generated TypeScript file against your environment in the same call. No slides.

Frequently asked questions

What is UiPath, in one sentence, in the context of robotic process automation?

UiPath is the largest commercial RPA vendor; it sells a three-product platform (Studio for authoring, Robot for the runtime, Orchestrator for queuing and scheduling) that records a workflow against an application's UI elements and replays the recorded actions later through a Windows agent. The marketing page at uipath.com calls this 'robotic process automation', which is correct, but conflates the three products into one word.

What does each of the three UiPath products actually do?

Studio is the authoring IDE: a Visual Basic style canvas where a developer drags activities and saves a .xaml file. Robot is the runtime: a Windows service that loads the .xaml and asks Windows UI Automation to find the recorded elements. Orchestrator is the central server: a queue, a database, an asset vault for credentials, and a scheduler. Studio is licensed per developer seat, Robot is licensed per VM (attended or unattended), and Orchestrator runs on its own license tier. A working enterprise deployment buys all three.

Why is selector breakage the most common failure mode in UiPath?

Because the recorded selector is the only piece of evidence the Robot has to find the target element at replay. If the application ships an update that renames an aaname, repositions a panel, or changes a control id, the selector misses and the Click activity throws. UiPath's UI Explorer lets a developer rebuild the selector by hand, but the rebuild has to happen for every workflow that touched the changed surface. At an enterprise running 200+ workflows on SAP GUI or a banking core, the maintenance backlog is the silent cost of the architecture.

How is Mediar's executor different from UiPath's Robot?

Three differences. First, scope: Mediar's executor is one Rust file (crates/executor/src/services/typescript_executor.rs, 871 lines) plus the MCP server that wraps the Terminator SDK. UiPath's Robot ships as a Windows installer with selector engines for Win32, browser, Java, SAP, mainframe, and PDF, plus the OCR add-on. Second, the matching strategy: Mediar's runtime walks four strategies in order before pausing (focus_state.rs, lines 168 to 196), so a selector miss does not fail the run. Third, the source: terminator-rs is open source under MIT at github.com/mediar-ai/terminator. UiPath's runtime is closed source.

Where does AI live in a Mediar workflow, and where does it live in a UiPath workflow?

In Mediar, the AI is in authoring only. A Gemini Vertex AI pipeline reads the recording session, infers the eight semantic fields per step, and emits the TypeScript file. After generation finishes, the executor takes over and the model is gone from the loop. In UiPath, the AI lives in a separate product line called UiPath AI Center plus a newer 'Agent Builder', priced separately and bolted onto the existing Robot runtime. The base UiPath Robot is plain selector-based RPA the way it has been since 2014. UiPath has been moving toward 'agentic automation' branding since 2024, but the underlying Robot is still the same selector engine.

What does pricing look like, mechanically?

UiPath's public list pricing on the website starts around $420 per month per attended Robot license and rises into per-Orchestrator-tier and Studio-seat charges; large enterprise deals quoted at the Center of Excellence level commonly land in the $100K to $500K-plus range for the first year, with annual renewals. Mediar charges $0.75 per minute of executor runtime plus a $10,000 turn-key program fee that converts to credits. A workflow that runs ten minutes a day costs $1,575 a year in runtime. The math behind the 'twenty percent of UiPath' claim on the marketing site is that ratio at typical small-to-mid enterprise volumes; the savings shrink at very high concurrency and grow on long-tail workflows.

Should we keep our existing UiPath workflows or migrate?

If they are stable, keep them. The architecture pays for itself when a workflow runs for years against a frozen interface. The migration thesis applies to two specific patterns: workflows that have stalled in development for more than three months because the selectors keep breaking, and workflow categories that need to be extended to new applications faster than your RPA Center of Excellence can hire developers to maintain them. We see the second pattern most often at mid-market banks on Jack Henry, Fiserv, or FIS, where the queue of automation requests is much longer than the team that ships them.

Does Mediar work with UiPath's queue or activities ecosystem?

No. The two systems do not share workflow files or selectors. UiPath uses .xaml plus a NuGet package; Mediar uses a TypeScript file in git plus a Postgres-backed queue. A migration is a re-recording of each workflow against the same target application. The re-recording is fast (a human walks the path once and the AI authoring layer emits the TypeScript), but it is a re-recording, not a translation. Teams that want a phased move usually keep UiPath running for stable workflows and onboard new automation requests directly onto Mediar.

Is Mediar a UiPath alternative or an AI agent?

A UiPath alternative at the runtime layer, with an AI authoring layer on top. The runtime is deterministic Rust calling Windows accessibility APIs through MCP. The authoring is an LLM call that runs once per workflow. Most enterprise compliance teams reject 'fully autonomous AI agents' that call a model on every step at runtime because audit and reliability behave badly when the action sequence is non-deterministic. Mediar splits the responsibilities so the authoring is intelligent and the runtime is predictable. UiPath has been making the same split with the AI Center / Agent Builder bolt-ons; the difference is where the boundary sits.

Adjacent guides on the same topic

Keep reading

Architecture

What robotic process automation is, in three layers you can open in source

A six-event capture filter, a four-stage synthesis pipeline, and a four-strategy replay cascade. Every working RPA system has all three; the differences between vendors live inside one of them.

Read

Etymology

Meaning of robotic process automation: how the term split into two architectures

Blue Prism coined the phrase in 2003 to describe a Windows scripting runtime. In 2026 the same three words point at two incompatible architectures. A word-by-word reading of the term.

Read

Company

Mediar: the company, the founders, and the open-source SDK

Background on the Y Combinator backed company at mediar.ai, the open-source Terminator SDK on github.com/mediar-ai/terminator, and how the open-source pieces fit into the commercial product.

Read