Connecting AI Agents to Legacy Manufacturing Systems
How to connect AI agents to legacy systems — green-screen ERP, no API, dirty data. A plant operator's playbook for getting value without a rip-and-replace.
Every vendor demo of AI agents legacy systems integration assumes you're running something modern with a clean REST API. Your plant isn't. You're running a 1990s ERP with a green-screen front end, a homegrown scheduling tool a guy named Dave wrote in 2008, and an MES that exports to CSV when it feels like it. I ran AI at a $250M furniture manufacturer with exactly this kind of stack, and the good news is you do not need to replace any of it to get value. You need to meet it where it is.
The instinct from IT is often "we can't do AI until we modernize the ERP." That's a five-year, seven-figure project, and it's the wrong order. Agents can wrap legacy systems today. The question is which wrapping method fits your specific constraint, and how to deal with the data that comes out the other side.
Find your access door first
Before you scope a single agent, answer one question: how can software read this system without a human typing? There are four doors, and most legacy environments have at least one open.
| Access door | What it needs | Reliability | Use when |
|---|---|---|---|
| Database read access | Read credentials to the underlying DB | High | The ERP sits on SQL Server/Oracle/DB2 you can query |
| File/EDI exports | Scheduled CSV, flat-file, or EDI drops | High | The system already exports on a schedule |
| Legacy API / SOAP | An old but real web service | Medium | Vendor shipped SOAP/XML endpoints |
| Screen scraping / RPA | Terminal or UI automation | Low | None of the above exists |
Most "impossible" legacy ERPs are sitting on a perfectly queryable database — the green screen is just a front end. If your IT team can hand you read-only DB credentials, you're 80% of the way to a working agent, no modernization required. We ran several agents this way: the front end stayed exactly as ancient as before, and the agent read straight from the tables underneath.
If the database is locked down or genuinely inaccessible, fall back to whatever the system already exports. Manufacturing systems are export machines — nightly batch files, EDI 856s, scheduled reports. An agent that ingests the same file your accounting team already gets is boring, reliable, and ships in a week.
The data lake pattern beats live integration
For legacy systems specifically, don't make the agent query the old system in real time. Stage the data instead.
The pattern: pull from the legacy system on a schedule into a modern staging layer — a cloud database, a data warehouse, even a well-organized set of tables. The agent reads the staging layer, never the legacy system directly. This buys you three things:
- You don't load the old box. A 20-year-old ERP doesn't appreciate an agent hammering it with queries. The staging layer absorbs that.
- You clean once, read many. Transformation and cleanup happen in the pipeline, so the agent sees consistent data instead of legacy quirks.
- You decouple the agent from the legacy system's quirks. When you eventually modernize, you swap the source feeding the staging layer and the agent never notices.
Latency is the trade. A nightly or hourly refresh means the agent isn't truly live. For 90% of legacy use cases — supplier-doc lookup, order-status Q&A, planning questions, ops-review prep — yesterday's data or last-hour's data is completely fine. Reserve real-time integration for the rare case that genuinely needs it.
Your data is dirty. Plan for it.
The real obstacle with legacy systems isn't access. It's data quality. Legacy systems accumulate decades of inconsistency: three spellings of the same supplier, units that switch between each and case mid-table, part numbers with trailing spaces, free-text fields where structure should be.
An agent fed this data confidently produces wrong answers. That kills trust faster than anything, and a single bad answer in front of a skeptical plant manager can end the whole initiative.
Deal with it in the pipeline, not in the prompt:
- Normalize entities. Map the three spellings of "Acme Steel" to one. Standardize units, dates, part formats in the staging layer.
- Flag low-confidence data. When a field is ambiguous, the agent says so — "I found two records that might match, here are both" — instead of guessing.
- Scope to the clean subset first. Don't try to cover every record. Pick the data domain that's cleanest — say, active suppliers or current open orders — and build the first agent there. Expand as you clean.
Messy data isn't a reason not to start. It's a reason to start narrow.
Match the agent to the legacy constraint
Not every agent fits every legacy stack. Match the use case to what your access door supports.
- DB read access, decent data → order-status exception lookup, supplier-doc intelligence, demand and inventory Q&A. High-frequency lookups that replace screen-chasing.
- File/EDI exports only → ops-review prep, exception reporting, anything batch-tolerant that runs off the data you already export.
- Screen scraping only → keep the scope tiny. One narrow lookup, read-only, with a human verifying. RPA on a legacy UI is fragile; don't build something critical on it.
Why this beats waiting for modernization
The modernization-first path has a hidden cost: you spend two years and a lot of money before anyone in operations sees a single benefit. By then the AI landscape has moved, the budget is exhausted, and the agents are still hypothetical.
Wrapping legacy systems flips the order. You ship a working agent in 30 days against the stack you have, put a number on the board — hours saved, errors caught — and fund the next one with proof instead of a promise. When modernization does happen, your agents are already built against a staging layer that doesn't care what's underneath. You re-point one pipe.
The legacy system isn't the blocker everyone treats it as. It's just a constraint to design around, and the design is well-understood.
Want to know which agent your specific legacy stack can support today? Grab the free First 5 Agents teardown — send me your ERP and how it exposes data, and I'll map the five agents you can ship without modernizing anything. Then book a 20-minute call and we'll figure out your access door and your cleanest data domain to start with.
Let's see what's worth building first.
A 15-minute call: tell me where your AI or planning is stuck, and I'll tell you the one thing worth building first — and whether it's worth doing at all.