What Is an AI Engineer? The Role, Explained by a Hirer

What is an AI engineer? Someone who builds production AI features on foundation models. Here is the role, what they do, and when you need one.

An AI engineer is the person who turns a foundation model into a feature your customers can actually use. They do not train models from scratch and they do not write research papers. They take models that already exist, wrap them in retrieval, tool calls, evals, and product logic, and ship something that holds up when a real user hits it at 2am. That is the whole job in one sentence, and almost everything written about the role buries it under a list of frameworks.

I am writing this from the hiring side of the table. I started as an engineer and now run conversion and revenue, which means I sit in two seats at once: I read the traces, and I read the P&L. At Devlyn I have hired more than 80 senior AI engineers and shipped over 200 products on top of them. So when someone asks me what an AI engineer is, I am not answering from a course syllabus. I am answering from the cost of getting the definition wrong, because every company that misunderstands this role hires the wrong person for it.

So if you came here asking what is an AI engineer, the short answer is the one above, but the useful answer is about the boundaries of the AI engineer role. The title is new, the field moves weekly, and three different jobs all have "AI" or "ML" in them. If you hire an AI engineer expecting a researcher, or a data scientist expecting an AI engineer, you pay a senior salary for a skill set that does not match the work.

This piece is the definition I wish more hiring managers had before they opened a req. It answers what an AI engineer is, what they do, how the role differs from the adjacent ones, and when your company actually needs to hire for it.

An AI engineer builds, they do not invent. Their output is a working production feature on top of an existing model, not a new model or a published result.
The role is defined by what ships, not by tools. The model names, the vector database, the orchestration library are all learnable. Production judgment is the scarce part.
It is a distinct role from ML engineer and data scientist. Different inputs, different outputs, different failure modes. Confusing them is the most expensive hiring mistake in this category.
The market is real and priced accordingly. Median base pay for the role sits around $134K in the US, and the good ones cost more because they are rare.
Not every company needs one yet. You need an AI engineer when an AI feature is going in front of customers and the cost of it being wrong is real.

What an AI engineer actually is

The term is recent. Shawn Wang coined "AI Engineer" in a 2023 essay called The Rise of the AI Engineer, and his framing still holds up. His argument was that foundation models had pushed a huge amount of capability out of the research lab and into the hands of ordinary software engineers. Tasks that used to need a research team and five years now need API docs and a focused afternoon.

His sharpest line is the one most hiring managers miss: "One can be quite successful in this role without ever training anything." That is the load-bearing distinction. An AI engineer can have a long, successful career and never once train a model, tune a loss function, or touch a GPU cluster. They consume models the way a backend engineer consumes a database. The model is an input to their work, not the work itself.

The supply math explains why the AI engineer role exists at all. Wang estimated something like five thousand people on the planet who can genuinely train frontier models, against fifty million software engineers who could learn to build products on top of them. The bottleneck was never going to be researchers. It was always the engineers who could take what the researchers built and make it survive contact with a customer, and that second group is what we now call AI engineers.

So the honest definition is functional, not technical. An AI engineer is a software engineer whose specialty is shipping reliable features powered by foundation models. The specialty shows up in the things that break: hallucination, latency, cost per call, prompt fragility, tool-call failures, and the gap between a demo that works once and a feature that works ten thousand times a day. If you want the deeper skills breakdown, I wrote a full piece on the skills that actually matter, and you can go there once you have the shape of the role.

What an AI engineer actually does day to day

Ask ten people what an AI engineer does and you will get ten lists of tools. That is the wrong altitude. The day-to-day is better described by the problems they own, because the tools change every quarter and the problems do not.

An AI engineer spends their day deciding what the model should and should not be asked to do, then building the scaffolding around it so the answer is right often enough to ship. That means designing prompts and context, wiring up retrieval so the model has the right facts, adding tool calls so it can act instead of just talk, and building the evals that tell you whether any of it is working. Then they watch it in production and fix the failures the evals did not catch.

Here is a concrete, NDA-safe version. A team I worked with had a support assistant that looked great in the demo and fell apart in the wild. The AI engineer's first week was not spent swapping models; it was spent reading two hundred real transcripts and sorting the failures into buckets. The discovery was that 60% of the bad answers came from one retrieval gap: the model could not see the customer's order history, so it guessed.

The fix was plumbing, not intelligence. That is the job, and the model was never the problem.

A second example, also illustrative. Another build kept blowing its latency budget because every request hit the largest available model. The AI engineer routed the easy 80% of requests to a smaller, cheaper model and reserved the expensive one for the hard tail. Cost dropped, speed improved, and quality held, because most requests never needed the big model in the first place.

Knowing which requests are which is the kind of judgment you are paying for. A lot of that work lives in agentic workflows, where the model is not just answering but taking steps.

If you are early in scoping a hire and want this in a format you can hand to recruiting, our team put the role into a usable spec on the AI application engineer page, which describes exactly the production work I am talking about here.

A responsibility-to-reality table

The reason "AI engineer" sounds vague is that job descriptions list responsibilities at a level of abstraction that tells you nothing. Here is the same set of responsibilities translated into what they actually look like on a Tuesday.

Responsibility	What it looks like day to day
Model integration	Calling a foundation model via API, handling timeouts, retries, and rate limits like any other unreliable dependency
Retrieval and context	Building the pipeline that finds the right documents and feeds them to the model so it answers from facts, not guesses
Prompt and output design	Shaping inputs and enforcing structured outputs so downstream code can trust what comes back
Evaluation	Building a frozen test set, scoring each change against it, and refusing to ship on vibes
Cost and latency control	Routing requests to the cheapest model that clears the bar; watching cost per resolved task, not cost per call
Production observability	Reading traces, catching the failures evals missed, and closing the loop with a fix

Notice that none of these rows say "train a model." That row belongs to a different job, which is the source of most of the confusion, so let me draw the line clearly.

The skills that actually matter

The skills that get listed on job posts are the easy ones to name and the least predictive of success. Yes, an AI engineer should know the common orchestration libraries, vector databases, and model APIs. But those are table stakes, and they are learnable by any competent engineer in a few weeks. They are not what separates a good hire from an expensive mistake.

The scarce skill is production judgment: the ability to look at a model output and know whether it is correct, why it is wrong when it is wrong, and what to change before it touches a customer. I have watched two engineers with identical resumes diverge completely on this. One ships a feature, sees it pass the demo, and moves on. The other ships the same feature, distrusts the demo, builds an eval set, and finds the 8% failure rate that would have cost the company a week of firefighting.

Same tools, different judgment. Only one of them is worth a senior salary.

The model names, the vector database, the orchestration library are all learnable in weeks. Production judgment is the part you cannot teach in an onboarding doc, and it is the part you are actually paying for.

This is why evaluation skill is the truest signal in the whole role. An engineer who treats LLM evaluation as a first-class part of the work, not an afterthought, is an engineer who understands that the model will be confidently wrong sometimes and that catching it is their job, not the user's. When I interview for this role, how a candidate talks about measuring quality tells me more than anything on their resume.

AI engineer vs ML engineer vs data scientist, at a glance

These three roles get used interchangeably, and the cost of confusing them is real. They take different inputs, produce different outputs, and fail in different ways. Here is the distinction in one view.

Role	Primary output	Works mostly with	Hire them when
AI engineer	A production feature built on an existing model	Foundation model APIs, retrieval, evals, product code	You are shipping an AI feature to users
ML engineer	A trained or fine-tuned model and its pipeline	Datasets, training loops, GPUs, model architecture	You need a custom model the market does not sell
Data scientist	An analysis, insight, or prediction	Statistics, notebooks, dashboards, experiments	You need to understand your data and answer questions

The shortest way to remember it: a data scientist tells you what the data says, an ML engineer builds a model when you cannot buy one, and an AI engineer ships the feature on top of a model someone already built. Most companies in 2026 need the third role far more often than the first two, because foundation models removed the need to train your own for the majority of product use cases. If you are still deciding which one you actually need, our definitive guide to hiring AI engineers walks through the call.

When your company actually needs an AI engineer

Here is the part the career listicles skip, because it only matters if you are the one writing the check. Not every company needs a dedicated AI engineer yet, and hiring one too early is its own kind of waste.

You need an AI engineer when three things are true at once. First, you are putting an AI-powered feature in front of real users, not running an internal experiment. Second, the cost of that feature being wrong is real, whether that is a bad customer experience, a compliance risk, or money. Third, the feature is core enough that you cannot afford to have it half-work indefinitely.

When all three hold, the production judgment of an AI engineer is the thing standing between you and a feature that embarrasses you in front of a customer.

You probably do not need one yet if your AI usage is internal tooling, occasional, and low-stakes, the kind of thing a strong generalist engineer can bolt on with an API key. The mistake I see most often runs the other way, though: a company ships an AI feature with a generalist, it works in the demo, it fails quietly in production, and nobody on the team has the eval discipline to notice until support tickets pile up. By then the fix is more expensive than the right hire would have been. The timing question deserves its own treatment, which is why I wrote about when to hire an AI engineer separately.

There is also a cost reality to be honest about. The median base for this role sits around $134,000 in the US per Coursera's salary data citing Glassdoor, and the genuinely good ones command more because the supply is thin. That is not a number to flinch at if the feature matters; it is a number to plan around. I broke down the full picture, including the senior premium, in what an AI engineer actually costs.

How to hire one without getting burned

Once you know you need the role, hiring well comes down to testing for the scarce thing, not the learnable thing. Most hiring processes for this role test framework recall, which is exactly backward. A candidate who can name every orchestration library and cannot tell you how they would catch a hallucination in production is the wrong hire wearing the right keywords.

Test for judgment instead. Show a candidate a real model output that is subtly wrong and watch whether they catch it and explain why. Ask how they would build the eval set for a feature, and listen for whether they reach for real production traffic or for a number that makes them feel good. The answers separate the operators from the resume-keyword matchers fast.

I put the full interview approach, including the questions that actually predict success, into a job description and hiring framework you can adapt.

If you would rather not run that gauntlet yourself, that is precisely the gap our team fills. Hiring senior AI application engineers who have already proven this judgment in production is what Devlyn does, so you skip the part where you learn the failure modes by hiring wrong first. And if you are building the wider team around the role, the playbook is in my book Building an AI-Native Team.

Frequently asked questions

What is an AI engineer in simple terms? An AI engineer is a software engineer who builds production features on top of existing foundation models. They do not train models from scratch. They take models that already exist and wrap them in retrieval, tool calls, evals, and product logic so the result is reliable enough to put in front of real users.

What does an AI engineer do day to day? They design what the model is asked to do, build the retrieval and tool-calling scaffolding around it, write evals that measure whether it works, and then watch it in production and fix the failures. The work is mostly engineering plumbing and judgment, not model math. Reading real traces and catching the cases the demo hid is a large part of the job.

What is the difference between an AI engineer and an ML engineer? An ML engineer trains and fine-tunes models, working with datasets, training loops, and GPUs. An AI engineer ships products on top of models that already exist, working with APIs, retrieval, and evals. You hire an ML engineer when you need a custom model the market does not sell, and an AI engineer when you are putting an AI feature in front of users.

Do you need a PhD to be an AI engineer? No. The role was defined specifically by the fact that you can succeed in it without ever training a model or holding a research degree. What matters is software engineering ability plus production judgment about where AI features break. The scarce skill is knowing whether an output is right and why it is wrong, not the academic credential.

If you want the full hiring playbook, start with the definitive guide to hiring AI engineers. And if you would rather hire a senior AI engineer who has already shipped production features and proven the judgment this article describes, that is exactly what the Devlyn team does. Define the role correctly, then hire for the part you cannot teach.