Automat (YC), explained
What Automat (YC) is, and the one thing its pages never compare: who owns the engine
Direct answer · verified 2026-06-18
Automat is a Winter 2023 Y Combinator company building AI agents for robotic process automation. It was founded by Lucas Ochoa (CEO) and Gautam Bose, turns a screen recording or a written description into an automation using a transformer-based RPA engine plus computer vision, and raised a $15.5M Series A in November 2025, led by Felicis. Its product lives at runautomat.com.
Sources: ycombinator.com/companies/automat and runautomat.com, checked 2026-06-18. The rest of this page is about the one axis those pages do not cover: whether the automation engine is something a vendor operates for you, or something you can read and run yourself.
What Automat actually does
Automat is one of the stronger entries in the new wave of AI-native RPA companies. The original YC launch described it plainly: robotic process automation for Chrome using ChatGPT and computer vision. Since then it has broadened. Today it pairs UI agents (a proprietary transformer-based RPA engine that mimics how a person uses an interface) with API-driven automations and intelligent document processing, and it says it runs across Windows apps, Citrix, web portals, and legacy systems without API access.
The way you adopt it matters as much as the technology. Automat's own three-step process is: you upload a screen recording or process docs, then Automat's team and AI agents build, validate, and test the automation, then it deploys via hosted cloud, on-prem, or a custom API. The company hires forward deployed engineers and sells a done-with-you model. That is a deliberate choice, and for some buyers it is exactly the right one. It is also the thing that separates it most from how Mediar is built.
Two ways to own an automation
Strip away the marketing and there are two structurally different answers to the question “where does the automation live, and who can change it?” Automat answers it one way. Mediar answers it the other. Neither is wrong; they optimize for different teams.
Vendor-built, forward-deployed
Automat's model
- You hand over a recording; the vendor's team and AI build the bot.
- The engine that runs it is proprietary and operated by Automat.
- Computer vision and Vision Transformer models recognize the fields.
- Good when you want a partner to own delivery end to end.
Self-serve, open-source engine
Mediar's model
- You record the workflow once, or write it as plain-text steps.
- The execution engine is open source under MIT; you can read it.
- Controls resolve by role and name in the accessibility tree, not pixels.
- Good when you want to audit, version, and extend the engine yourself.
The anchor fact: you can read Mediar's engine before you trust it
Here is the part you can verify yourself, today, without a sales call. A Mediar workflow is not a black box that a vendor operates on your behalf. It is a plain-text file: a list of steps, each with a tool, a target, and its own error strategy. The same engine that runs it, the Terminator SDK, is open source under the MIT license and primarily written in Rust, at github.com/mediar-ai/terminator.
Notice the selector lines. They do not say “the box at x=412, y=290” and they do not hand a screenshot to a vision model. They name the control: role:Edit|name:Vendor Code. That is a lookup against the Windows accessibility tree, the same structured list of controls every well-behaved app publishes for screen readers. The on_error field is equally concrete: each step is stop, continue, retry, or fallback, so a failed step does something you specified rather than something a hosted runtime decided for you.
And because the lookup is by role and name, you can confirm a given app is automatable before you write a single step. Run the engine's inspector against the window and it prints the controls it can see, sourced from the accessibility tree rather than a picture of the screen:
This is the difference that does not appear on any page ranking for Automat. Their pages tell you what the product does and who funds it. None of them let you read the engine and decide whether your own legacy screen exposes the fields by name. With an open-source engine, that is a five-minute check, not a procurement question.
Why the targeting layer is worth arguing about
The reason any of this matters is that the expensive, stuck workflows in most enterprises sit on no-API desktop software, and that is exactly where recognizing a field from its rendered image is the hardest. When the targeting layer holds, the savings are real and the workflow is named, not a press-release rounding.
“We moved an F&B chain running SAP Business One off UiPath. Their CFO told the board they are now saving 70 percent on costs.”
Mediar deployment, F&B chain on SAP B1
For the underlying mechanism, see the pixels-versus-accessibility-tree breakdown (written about another vision-based RPA company, Sola, but the engine argument is the same), and why no-API legacy desktop apps are the real moat.
Where Automat's model is the better fit
If you want a funded partner to own automation delivery, you have the budget for a forward-deployed engagement, and your workflows live on surfaces a vision engine handles cleanly, Automat is a credible choice, and its $15.5M Series A says the market agrees that AI agents are eating legacy RPA. The honest dividing line is not which company is better in the abstract.
It is two questions about your own situation. Do you want to own and extend the engine yourself, or have it operated for you? And do the systems you are automating publish a usable accessibility tree, or do they only exist as pixels on a screen? If your answers are “own it” and “the data is in SAP GUI, a mainframe terminal, Jack Henry, Fiserv, FIS, Epic, or Cerner,” the open-source, tree-reading approach is usually the one that survives in production.
Bring the workflow you would have handed to a forward-deployed team
Show us the legacy screen on the call and we will tell you whether the accessibility tree can drive it, and you can read the engine that would, before you commit to anything.
Questions people ask about Automat (YC)
What is Automat, the Y Combinator company?
Automat is a Winter 2023 Y Combinator company building AI agents for robotic process automation. It was founded by Lucas Ochoa (CEO) and Gautam Bose, who met at Carnegie Mellon's design school and previously worked on LaMDA prototypes at Google. The product turns a screen recording or a written description of a workflow into an automation using a transformer-based RPA engine and computer vision. It is at runautomat.com.
How much has Automat raised?
Automat raised a $15.5M Series A announced in November 2025, led by Felicis with participation from Initialized, Khosla Ventures, Y Combinator, K5 Global, and Input Capital. Multiple outlets put the company's total capital raised at roughly $19.25M including the earlier seed. The round was earmarked for migrating enterprises off legacy RPA and growing the product, engineering, and sales teams.
How is Mediar different from Automat?
Two ways. First, the build model: Automat's three-step pitch is upload a recording, Automat's team and AI agents construct and test the automation, then deploy. It is a vendor-built, forward-deployed service running a proprietary engine. Mediar ships you the tool to record and run workflows yourself, and the execution engine, the Terminator SDK, is open source under MIT at github.com/mediar-ai/terminator. Second, the targeting: Automat recognizes fields with computer vision and Vision Transformer models, while Mediar resolves them by role and name in the Windows accessibility tree.
Is Mediar's automation engine really open source?
Yes. The Terminator SDK that reads and drives the accessibility tree is MIT-licensed and primarily written in Rust, with TypeScript, JavaScript, and Python bindings, at github.com/mediar-ai/terminator. You can read exactly how a control is resolved (by selectors like role:Button and name:Post) and extend it for custom workflows. Automat's transformer RPA engine is proprietary and operated by Automat.
Does Automat work on legacy desktop apps like SAP or mainframes?
Automat states it runs across Windows applications, Citrix, web portals, and legacy systems without API access, using its UI agents and computer vision. The honest dividing line is not the brand but the surface: on dense legacy desktop UIs (SAP GUI, 3270 green-screens, banking cores) recognizing a field from its rendered image is the harder bet, while reading the accessibility tree the app already publishes to screen readers is usually the more stable one.
When is a vendor-built service like Automat the right call?
If you want a partner to own the automation end to end, you have budget for a forward-deployed model, and your workflows sit on surfaces a vision engine handles well, that model is reasonable. If you want to record and run automations yourself, keep the workflow as readable text, and have an engine you can audit and extend, the open-source approach fits better. Both are valid; they optimize for different things.
Keep reading
Sola RPA: computer vision over pixels vs the accessibility tree
Another AI-native RPA company, and the one mechanism that decides where a vision bot holds up and where reading the tree is the only thing that works.
What CloudCruise is, as a company
A profile of another automation startup in this space, and where the desktop layer changes the comparison.
How RPA automation actually runs at the engine level
What an automation is as data: a sequence of steps, each with its own error strategy, driven through the accessibility tree.
Comments (••)
Leave a comment to see what others are saying.Public and anonymous. No signup.