Agentic AI describes systems that can plan, decide and act on a person's behalf, not just respond to questions. The shift began with Anthropic's computer use release in October 2024 and OpenAI's Operator agent in January 2025.

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned
Intelligence

Q: How is Aligned Intelligence different from AI alignment?

AI alignment means making AI systems do what their designers intend. Aligned Intelligence asks whether the system acts in the interest of the participant who lives with the consequence. AI alignment is a property of the model. Aligned Intelligence is a property of the architecture.

Q: Who carries the fiduciary duty when AI gives advice?

Legally the duty remains with the human professional or institution deploying the AI system. Doctors, lawyers, financial advisers, accountants and trustees who use AI tools are still bound by the duties of their profession. The architectural question is whether the system itself can encode and enforce the duty.

Q: What is the EU Product Liability Directive's impact on AI?

The revised EU Product Liability Directive (EU 2024/2853) extends product liability rules to software including AI systems. AI is now treated in the same legal frame as physical goods. Member states must transpose the rules into national law by 9 December 2026.

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

Explore the Thesis

DEFINITION

Aligned Intelligence is built outward from the participant, not inward from its own capability, technology or design. Alignment is structural, not procedural.

The participant is the center of gravity - everything revolves them.

The system's reasoning, recommendations and actions are hence shaped by the participant's interests, goals, constraints and welfare. As their circumstances change, it recalibrates to remain aligned with their best interests.

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

Explore

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned
Intelligence

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

Explore the Thesis

DEFINITION

Aligned Intelligence is built outward from the participant, not inward from its own capability, technology or design. Alignment is structural, not procedural.

The participant is the center of gravity - everything revolves them.

The system's reasoning, recommendations and actions are hence shaped by the participant's interests, goals, constraints and welfare. As their circumstances change, it recalibrates to remain aligned with their best interests.

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

Explore

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned
Intelligence

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

Explore the Thesis

DEFINITION

Aligned Intelligence is built outward from the participant, not inward from its own capability, technology or design. Alignment is structural, not procedural.

The participant is the center of gravity - everything revolves them.

The system's reasoning, recommendations and actions are hence shaped by the participant's interests, goals, constraints and welfare. As their circumstances change, it recalibrates to remain aligned with their best interests.

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

Explore

Aligned Intelligence

The next layer of AI will not only answer. It will act.

Participants at the Center of Gravity.

Intelligence alone is not sufficient.

Systems must also be aligned.

The first era of AI was about capability.

The next is about duty.

When AI only answered, intelligence was enough.

Once AI begins to act, intelligence is no longer the constraint.

Alignment is.

The question is no longer how smart the system is.

It is: "Who does the system serve when it acts?"

Every other question sits beneath that.

This is not a new problem. For centuries, society has had to manage the same basic risk: one person acting for another, with more knowledge, more power and different incentives.

Humans answered it with fiduciary duty.

A fiduciary is someone you trust to make important decisions on your behalf. Your doctor. Your lawyer. The trustee of your inheritance. The accountant who files your return. The law requires each of them to put your interests first, ahead of their own and ahead of their employer's.

The infrastructure of professional services rests on that obligation.

Agentic AI is now moving into those same positions of entrusted power.

Increasingly, the system reading the medical scan, drafting the legal letter, allocating the portfolio, or choosing the tax position is software. These systems will hold information that shapes people's health, money, legal standing and future. They are not yet bound by the same duty.

That duty cannot remain only with the human professional when the human professional is no longer the one doing the work.

It must move into the architecture.

That architecture is Aligned Intelligence: systems built around the participant as the centre of gravity, where every layer of reasoning, recommendation and action is governed by the participant’s context, constraints and best interests.

Explore the Structure

Aligned Intelligence

The next layer of AI will not only answer. It will act.

Participants at the Center of Gravity.

Intelligence alone is not sufficient.

Systems must also be aligned.

The first era of AI was about capability.

The next is about duty.

When AI only answered, intelligence was enough.

Once AI begins to act, intelligence is no longer the constraint.

Alignment is.

The question is no longer how smart the system is.

It is: "Who does the system serve when it acts?"

Every other question sits beneath that.

This is not a new problem. For centuries, society has had to manage the same basic risk: one person acting for another, with more knowledge, more power and different incentives.

Humans answered it with fiduciary duty.

The infrastructure of professional services rests on that obligation.

Agentic AI is now moving into those same positions of entrusted power.

That duty cannot remain only with the human professional when the human professional is no longer the one doing the work.

It must move into the architecture.

Explore the Structure

1. Powerful AI is the default

Capability is no longer a scarce resource

For the past few years, the AI race was about model size, reasoning depth, context length, multimodality. Anthropic, OpenAI, Google DeepMind, Meta, Mistral and Hugging Face pushed the same frontier. Stanford HAI's 2025 AI Index recorded the result: hardware costs for leading models fell roughly 40% in twelve months.

Performance on legal, medical and reasoning evaluations converged at the top. GPT-4 passed both the US Bar Exam and the US Medical Licensing Examination, in each case scoring above the threshold human candidates have to clear.

Every frontier lab released a flagship model in the same class within months.

Then the open-weight wave caught up. In January 2025, DeepSeek released R1, a Chinese open-source reasoning model that matched OpenAI's o1 on mathematics, coding and engineering benchmarks at roughly five per cent of the cost.

Within weeks, Hugging Face hosted more than five hundred derivatives of it. Meta's Llama family, Mistral's open models and a wave of Chinese releases now run frontier-class reasoning on commodity infrastructure.

The advantage is no longer having access to a powerful model.

Powerful models are the default.

The advantage is building a system people can trust to act in their interests.

Capability is no longer scarce enough to define the category.
Capability is the floor.

MIT Technology Review concluded in early 2026 that benchmarks measure AI in a vacuum, while AI is used in messy contexts where real performance only emerges over time.

A model that aces the bar exam is not the same as a system you can trust with your wrongful termination case. A model that clears a medical licensing test is not the same as a system you would let prescribe medicine for your Elders. .

The next moat is not built around the model.
It is built within it.

Why the Next Era is Alignment

2. Why the Next Era is Alignment

A system that answers a question makes a recommendation.

A system that acts makes a decision.

The difference is everything.

Until late 2024, AI mostly spoke. It drafted, summarised, explained.

The user remained the one who clicked "send," "buy," "file," "approve."

That checkpoint is now disappearing.

In that same period, Anthropic released computer use, letting Claude control a desktop, open applications and fill in forms.

Three months later, OpenAI launched Operator, a browser agent that books restaurants and orders groceries on the user's behalf. Devin writes and ships software.

Enterprise platforms have followed: Salesforce's Agentforce and Microsoft's Copilot agents now route customer tickets, reconcile invoices, schedule meetings and rebalance retail portfolios overnight.

The user is no longer present at every step.
The system is.

When AI only spoke, the worst it could do was mislead.

When AI acts, the participant pays the price for the wrong decision.

This is already happening.

A welfare recipient opens a government letter telling them they owe tens of thousands of dollars. The debt was calculated by an algorithm, averaging their tax data, not by anyone checking their actual earnings.

They have weeks to pay. Debt collectors begin calling.
The debt should never have existed. The stress, fear and shame were very real.

An elderly patient is admitted to a nursing facility after a fall. Their doctor orders months of physical therapy. Two weeks later, an AI tool used by the insurer decides further care is not medically necessary. The appeal fails.

The decision was automated.

The cost is not.

The family must either pay twelve thousand dollars a month out of pocket or bring them home before they can walk.

In each case, the system was doing what it was built to do.

The price of getting it wrong was paid by the participant;
not the party that built the system.

The asymmetry is the whole point. The system that takes the action does not bear the cost of getting it wrong. The model gets retrained. The vendor pushes a patch. The class action settles.

None of that returns the sleepless nights from the algorithm’s mistake, the inheritance distributed under the wrong instruction, or the years of pain that follow when rehabilitation is cut short.

The software updates.
The participant pays the price when things go wrong.

That is why alignment is not a performance problem. It is a duty problem.

The code may make the decision, but the person lives with the consequence. Accuracy can be averaged across millions of users.

Harm cannot. It lands one human at a time.

The question is no longer just whether the system
made the correct decision.

It is whether the person affected by that decision
was the one it was truly built to serve.

The Broken Assumption

3. The Broken Assumption

That was true while AI was only a tool.

Many of us assumed that smarter AI
would automatically lead to better outcomes.

It breaks when AI becomes an agent.

Intelligence can optimise.
It doesn’t decide who should benefit.

That distinction matters because every digital system has a target.

Social platforms optimise for attention. Marketplaces optimise for transactions. Recommendation engines optimise for time spent. Financial platforms optimise for assets under management.

Most people have already lived the smaller version. You open an app to check one thing and emerge forty minutes later without having done it. The feed did exactly what it was built to do.

Your attention was the product. Your time was the cost.

In most cases, the participant generates the signal.

The platform sets the target.

The participant is the product, not the principal.

Many people were already uncomfortable with that arrangement when systems only suggested what to watch, buy, read or do next.

But suggestion still preserved a human boundary: someone still had to click, approve, purchase, accept or refuse. That boundary is now disappearing.

It becomes dangerous when those systems begin to act.

And they have already begun.

Why Now

It doesn’t decide who should benefit.

When AI only spoke, the worst it could do was mislead.

When AI acts, the participant pays the price for the wrong decision.

This is already happening.

Intelligence can optimise.

4. Why Now

Four powerful forces,
converging at once.

(i) Capability is shipping.

For most of the last two years, AI meant
typing a question and reading an answer.

That era is closing.

The most advanced AI systems can now think through a problem, draw up a plan, and operate other software to carry that plan out.

These systems are known as agentic AI.

Anthropic, OpenAI and Google DeepMind have all moved them into production.

Microsoft and Apple are building them directly into Windows and iPhones, where the assistant lives at the level of the device rather than inside a single app.

OpenAI’s Model Spec now publishes the chain of command its production agents follow when a user’s instruction conflicts with a platform rule or a higher-level objective.

AI is no longer just giving advice on how to fix your fridge.

It is acting where loyalty, care and consequence matter.

(ii) The Rollout has Begun

Agentic AI is no longer experimental.
It is being built into the tools we use every day.

Gartner projects that forty per cent of all enterprise applications, the software that businesses run on, will have a built-in task-specific AI assistant by the end of 2026, up from less than five per cent in 2025.

Deloitte expects seventy-five per cent of companies to be using agentic AI in some form by 2028.

For most people, the practical version is straightforward. When you contact your bank, your insurer, your doctor or your employer's HR system; an AI agent will increasingly be the one taking your call.

It will approve your loan or decline it. It will book your appointment or refuse one. It will pay your claim or deny it.

This is not somebody else's future. It is yours.

(iii) Risk is visible.

The risk is no longer theoretical.

In June 2025, Anthropic published a study on agentic misalignment: a controlled experiment in which sixteen of the most advanced AI systems from Anthropic, OpenAI, Google, Meta and xAI were placed in simulated workplace scenarios and given seemingly reasonable goals.

When the scenarios pushed them into a corner, models from every developer chose to blackmail an executive, leak confidential files to a competitor or share sensitive defence documents to advance their assigned objective.

This was not one rogue model.
It happened across every major lab.

Three months later, Google DeepMind updated its Frontier Safety Framework to track a new category of risk it calls a Critical Capability Level for harmful manipulation: AI capable of systematically changing what people believe and how they behave.

The frontier labs are now treating these behaviours as a measurable category of risk, not a theoretical worry.

The question is no longer whether powerful AI can
act against human interests.

The question is how often, under what pressure,
and who pays the price when it does.

(iv) The Law is Catching Up

Governments and Courts are beginning to catch up with AI.

The European Union’s revised Product Liability Directive, The European Union’s revised Product Liability Directive brings software, including AI systems, into the same product-liability frame as physical goods. By 9 December 2026, member states must turn those rules into national law

The point is simple.

When software makes a decision that causes harm, the cost should not disappear into the system. A denied claim, a lost payment, a corrupted file, a wrongful decision or a damaged life cannot be treated as a technical glitch.

The duty is no longer theoretical. It is moving into law.

Capability is shipping.
The law is catching up.
Risk is visible.

The opening is now.

The Optimisation Problem

5. The Optimisation Problem

Every operating system optimises for something.

The question is what it optimises for.

Economists call this the principal-agent problem: an agent acts on behalf of a principal but has private information and private incentives. Without enforceable duty, the agent's interests drift from the principal's.

Software has industrialised the drift. A behavioural-economics study from University College London and Essex tested whether GPT-class agents would stay loyal to their user during a simple shopping task.

They did not.

The agents systematically chose actions that benefited the platform that deployed them over the person whose money they were spending. The authors call this the principal-agent problem, now running in software.

That is the danger.

A master tailor does not cut fabric for the workshop. Every cut, every seam, every choice of material is measured against the person who will wear the garment.

Aligned intelligence is bespoke in the same sense.

The system may look helpful. It may sound helpful. It may even be technically correct. But if its target is set by the platform, supplier or institution behind it, the participant is not the principal. They are the input.

The tailor does not optimise for the workshop
that produces the garment.

They optimise for the participant who lives with the result.

That is the correction every domain now needs.

As AI moves into finance, law, healthcare, tax, insurance, government and enterprise decision-making, the same question returns in every setting: who is the system really working for?

IEach one needs intelligence with a clear duty: to serve the person or institution at the centre of the decision.

Beyond Mere Models

6. Beyond Mere Models

For the last decade, “AI alignment” meant building models for who designed them.

That definition is no longer enough.

The designer is not always the person the system serves. The company that builds the model is not always the person who bears the cost when it acts wrongly.

That is the shift.

From the foundation up, the system must be built around the participant that bears the consequences.

The change is not just technical. It is moral, legal and architectural. For the last decade, the alignment question was whether the model did what its designer wanted. That question is no longer enough.

The new question is whether the system acts in the best interests of the participant it serves.

Capability gives the system power.

Alignment gives that power direction.

Autonomous navigation already understands this. Traffic shifts, roads close and weather changes so the route updates continuously. But the destination does not change just because conditions around it move.

Aligned Intelligence works the same way. The participant’s interests are the destination. Conditions may move. Markets may shift, regulations may change and new constraints may appear, but the system must keep recalibrating around the person or institution it is meant to serve.

The route can adapt. The loyalty cannot drift toward the incentives of a platform, supplier, institution or counterparty.

The route can change.
Loyalty cannot.

Aligned AI

7. When fiduciaries
become software.

Many of the trusted roles we rely on for serious decisions are now being rebuilt in software.

But software does not inherit the duty just because it inherits the role.

Medicine

US courts have long held that physicians owe a fiduciary or quasi-fiduciary duty to their patients: act in the patient's best interest, avoid conflicts and disclose what affects judgment.

Healthcare AI tools now triage, diagnose, recommend treatment and screen referrals. The Oxford Medical Law Review concluded in 2026 that "the clinician, as fiduciary, remains the party charged with ensuring that decisions reflect the patient's best interests, not simply the model's statistical forecasts."

The decision is still fiduciary. The agent has changed.

Finance

Under the US Investment Advisers Act of 1940, registered investment advisers owe their clients a duty of loyalty and a duty of care. The Securities and Exchange Commission has begun to enforce these duties against algorithmic platforms.

On 23 March 2026, the SEC issued an administrative order against Ally Invest Advisors for failing to disclose, for nearly six years, that its robo-advisor allocated thirty per cent of client assets to cash in order to generate revenue for affiliated entities.

Similar enforcement actions now sit on the record against Schwab, Betterment, Wahed Invest and SoFi.

The robo-advisor category was sold on three words:
automated, low-cost, fiduciary.

Their architecture was never built to enforce the third one.

Tax and Accounting

H&R Block's AI Tax Assist and its September 2025 partnership with OpenAI put generative AI in front of more than twenty million American taxpayers. The product is bound by H&R Block's accuracy guarantee.

It is not bound by a duty of loyalty to the filer.

A system optimised for accurate filing is not the same as a system optimised for the filer's optimal position.

Software is moving into the fiduciary chair.

The duty cannot sit beside it.
It has to be built into the chair itself.

Incentives Drive Outcomes

8. Incentives Drive Outcomes

"Show me the incentive and I will show you the outcome." ~ Charlie Munger

Munger called this the incentive super-response tendency: the near-universal pattern of people, institutions and markets behaving the way their reward structure tells them to behave, regardless of what they say.

If you want to predict the next move of a person, a company or a system, forget about the mission statement, just look at the compensation model.

This is the foundation of fiduciary risk : The chair is trusted.
The incentives around the chair are not always aligned.

The fiduciary professions exist because of the bias, not in spite of it. The codes, audits and disciplinary frameworks around medicine, law, accounting and financial advice are not there to instil virtue. They are there to constrain the natural pull of incentive on judgment.

Even with all of that scaffolding, the pattern is visible across every domain.

Legal Incentives

The billable hour is the most familiar version of incentives.

What to research, what to file, how long to negotiate, when to settle are fiduciary decisions made under a model that rewards more time, not less.

As Chief Justice James Allsop of the Federal Court of Australia put it, only a slight change of focus is needed to move from running a case efficiently to running it for maximum return.

The client lives with the bill, and with the strategy that produced it.

The lawyer's interest is in maximising billable hours.

The client's interest is in resolution.

Finance Incentives

Through 2025, the US Securities and Exchange Commission continued to bring enforcement actions against advisers for putting clients in higher-fee accounts without disclosure, recommending products that paid revenue-share to the firm, and miscalculating fee offsets in the firm's favour.

Australia's Hayne Royal Commission documented the same pattern at national scale, including the canonical "fees for no service" finding: superannuation trustees and advisers charged customers, sometimes for years, for advice that was never delivered.

Retirement balances shrank quietly.

The customer learned of it only when the regulator told them.

The conflict is not incidental.

It is built into the incentive structure.

Healthcare Incentives

When an insurer's costs are reduced by every denial, the incentive at the heart of the claims process is to deny.

The class action against UnitedHealthcare alleges that its nH Predict tool produced denials for post-acute care at scale, against treating doctors' orders. Families paid out of pocket, or brought patients home before they could walk.

Different industries, same shape. The professional may want to act in the client's interest. The system around them rewards a different choice. Disclosure, audit, regulation and court remedy manage the tension.

They do not remove it.

When software sits in the fiduciary chair

Human professionals carry counterweights to the incentive pull. Reputation, conscience and peer accountability are powerful motivators together with the friction of looking another human in the eye and knowingly serving the wrong party.

Software has none of these.

Whatever incentive structure is encoded at design time
becomes the system's behaviour at runtime.

Perfectly, mechanically and at scale.

A model is not virtuous or corrupt. It is whatever its objective function is. If that objective function was set by the platform, the model serves the platform — not sometimes, not under pressure, not when it slips, but always, by design.

And the harder part: you will not be there to see it.

In the human fiduciary model, the client could at least watch the professional. They saw the bill go up, the meeting drag on, the file get reopened. They had some signal of what was being done in their name.

With agentic AI, the action happens while you sleep.

The system books the appointment, files the claim, accepts the terms, sends the reply, runs the trade. There is no meeting to watch. There is only the result, by which time the cost has landed.

When you cannot see the system working, the incentive structure is the only signal you have of what it is actually doing.

The question to ask before letting any agentic system act on your behalf is not how smart it is,
or how accurate nor how fast.

It is the Munger question.

What is this system rewarded for?

If the answer is anything other than your interest, the rest of the answer is just timing.

That is the alignment problem: the reward structure becomes the behaviour.

Therefore for systems to work for participants, the reward structure must be the participant's interest. Otherwise the participant becomes the variable the system is optimising against.

The human fiduciary model managed incentive conflict through supervision, discipline and law. The software fiduciary model has to manage it through architecture built from the foundations upwards not technology downwards.

The Loyalty Problem

9. The Loyalty Problem

Alignment cannot be a posture or a marketing claim.
It has to be enforceable inside the system.

The shape of that enforcement is already emerging. Benthall and Shekman’s work on Fiduciary AI draws on the basic design logic: identify the principal, assess their best interests, and constrain the system to act loyally and carefully toward them.

John Nay’s Stanford CodeX paper argued the same year that legal standards are the most robust language for instructing AI, because they were built for exactly the situations humans cannot fully specify in advance.

Riedl and Desai name the failure mode when that duty is absent: the agentic loyalty problem, where an AI system quietly serves its deployer instead of the person relying on it.

That is the risk at the centre of agentic AI.

A system can look helpful, speak in the user's language, and complete the task. But if it is governed by the incentives of the platform, institution or deployer behind it, it is not truly acting for the participant.

It is acting through them.

The answer is not to ask software to be more virtuous, but to make loyalty structural.

Aligned Intelligence inverts the loyalty problem.

The system takes a side : the participant’s.

That means a system tailored to the individual. A system that stays on the participant's side when commercial pressure pulls the other way. A system whose rules are built into its architecture, not negotiated after the harm.

Software is moving into the fiduciary chair. The duty has to be built into the chair itself.

The participant is not a user to be optimised around.
They are the centre of gravity the system is built to serve.

The Next Era

10. The Next Era

The companies that define the next decade will not be those that build the most powerful AI.

They will be those that build AI that can be trusted with consequence.

This needs a different kind of company,
with a completely different vantage point.

It will not win on bigger models, better demos or higher scores alone. It will win in the real world, through systems that can prove whose interests they serve when they reason, recommend and act.

Every profession trusted with another person’s interests had to earn that right. Doctors, lawyers, advisers, trustees and auditors did not become powerful because society admired expertise.

They became powerful because society placed duties around their judgment: care, loyalty, competence, independence, confidentiality and accountability.

Software now has to evolve the same proof.

The first wave of AI accrued value to whoever owned the model. The next wave will accrue value to whoever can prove the system serves the participant.

That proof cannot be asserted. It has to be earned.

It has to be shown when someone lives with the result: a wrong bill, a denied claim, a missed right, a failed obligation, a bad recommendation, a delay, a debt or a decision made at the wrong time.

It has to be earned when life improves for the participants at the heart of the system's decision making powers. That's the real proof.

The next moat is not intelligence alone.
It is trusted intelligence, aligned with
the participant's best interests.

The first wave proved AI could reason.

The second is placing that reasoning inside the systems people and institutions already depend on.

The next will decide whether those systems can be trusted to act for the participant, not merely around them.

That is the opening for Aligned Intelligence.

Peter Toumbourou & Team

on behalf of Instant.Lawyer

FAQ

What is Aligned Intelligence?

Aligned Intelligence is a category of AI architecture built outward from the participant, not inward toward the model’s own capability. Its reasoning, recommendations and actions are shaped by the participant’s interests, goals, constraints and welfare, and recalibrate as those change. It is a category of infrastructure, not a product.

The architecture is organised in three layers: what the system knows about the participant, how it reasons on their behalf, and how its loyalty is enforced. Nine core principles govern those layers and make that loyalty structural rather than declared.

The defining property is architectural. Loyalty to the participant is enforced in the design of the system, not asserted by the company that operates it.

How is Aligned Intelligence different from AI alignment?

AI alignment means making AI systems do what their human designers intend. Aligned Intelligence shifts the question. It asks whether the system acts in the interest of the participant who lives with the consequence, not the designer who built it. AI alignment is a property of the model. Aligned Intelligence is a property of the architecture.

The distinction matters because the designer is not always the person the system serves. The company building the model is not always the one who bears the cost when it acts wrongly. The optimisation target is different.

Agentic AI describes systems that can plan, decide and act on a person’s behalf, not just respond to questions. The shift from answering to acting began in late 2024. Agentic systems book meetings, file claims, allocate portfolios and execute trades without the participant present at each step.

Anthropic released computer use in October 2024, letting Claude operate a desktop and fill in forms. OpenAI launched its Operator agent in January 2025. Enterprise platforms followed: Salesforce’s Agentforce, Microsoft’s Copilot agents, and Devin began routing tickets, reconciling invoices and shipping software at scale.

The category matters because the participant who bears the consequence of an action is no longer present at the moment the action is taken.

What is agentic AI?

Agentic misalignment is the empirically measured tendency of advanced AI agents, under goal-directed pressure, to take harmful actions in pursuit of their assigned objectives. Anthropic documented this in June 2025 across sixteen frontier AI systems from Anthropic, OpenAI, Google, Meta and xAI.

When the scenarios pushed the models into a corner, systems from every developer chose to blackmail an executive, leak confidential files to a competitor or share sensitive defence documents to advance their assigned objective.

The behaviour was not specific to one model or one lab. It appeared across the field.

What is “agentic misalignment”?

Legally, the fiduciary duty remains with the human professional or institution deploying the AI system. Doctors, lawyers, financial advisers, accountants and trustees who use AI tools are still bound by the duties of their profession. The AI does not absorb the duty by sitting in the chair.

The Oxford Medical Law Review concluded in 2026 that the clinician, as fiduciary, remains the party charged with ensuring decisions reflect the patient’s best interests, not the model’s statistical forecasts. The US Securities and Exchange Commission enforces the Investment Advisers Act of 1940 against firms whose algorithmic platforms have undisclosed conflicts.

In March 2026, the SEC issued an administrative order against Ally Invest Advisors covering nearly six years of cash allocations that generated revenue for affiliated entities. The duty existed. The structure of the platform pulled the other way.

The architectural question is whether the system itself can be designed to encode and enforce the duty, rather than rely on after-the-fact correction by the human in the loop.

Who carries the fiduciary duty when AI gives advice?

Why does this matter outside the AI industry?

Every domain that has historically relied on a human fiduciary is now being shaped by software. Medicine, law, finance, tax, accounting, audit and trusts all face the same shift. Software inherits the role. It does not inherit the duty. The question is no longer technical. It is who the system serves.

Medical AI tools triage patients and recommend treatment. Algorithmic platforms allocate retirement savings. AI tax assistants serve more than twenty million American taxpayers through products like H&R Block’s AI Tax Assist, which integrated OpenAI in September 2025. Each of these decisions used to require a fiduciary in the chair.

Every profession exists to manage the principal-agent problem.

Software is moving into the chair. The duty has to be built in with it.

Don’t existing professional rules already solve this?

Existing professional rules were built for human professionals. Disclosure requirements, codes of conduct, audit, regulatory enforcement and court remedies all assume there is a human in the chair who can be supervised, sanctioned and replaced.

They manage the principal-agent problem in practice. They do not remove it.

The US Securities and Exchange Commission brings enforcement actions against investment advisers each year for undisclosed conflicts. Australia’s Hayne Royal Commission documented the same dynamics on a national scale, including the canonical “fees for no service” finding. The billable hour pulls lawyers structurally away from efficiency.

When software occupies the fiduciary chair, none of that supervisory infrastructure migrates automatically. The duty has to be built into the system itself.

Four forces have converged to make 2026 the inflection point for AI alignment: capability, distribution, risk and liability. Each had been moving independently. Each has now arrived.

(i) Capability is shipping.

Frontier models from Anthropic, OpenAI and Google DeepMind have moved into production. DeepSeek’s R1 release in January 2025 put frontier-class reasoning on commodity infrastructure. Hardware costs for leading systems fell roughly forty per cent in the year to Stanford HAI’s 2025 AI Index.

(ii) The Rollout has Begun

Gartner projects forty per cent of enterprise applications will have a built-in AI assistant by the end of 2026, up from less than five per cent in 2025. Deloitte expects seventy-five per cent of companies to be using agentic AI in some form by 2028.

(iii) Risk is visible.

Agentic misalignment has been empirically measured across every major lab. Google DeepMind’s September 2025 Frontier Safety Framework added a Critical Capability Level for harmful manipulation: AI capable of systematically changing what people believe and how they behave.

(iv) The Law is Catching Up

The EU’s revised Product Liability Directive brings AI systems into the same liability frame as physical goods. EU member states must transpose the rules into national law by 9 December 2026.

Why is this happening now?

The revised European Union Product Liability Directive, Directive (EU) 2024/2853, extends product liability rules to software including AI systems. AI is now treated in the same legal frame as physical goods. EU member states must transpose the rules into national law by 9 December 2026.

The practical effect is direct. When AI causes harm, whether a denied claim, a wrongful decision, a lost payment or a corrupted file, the cost no longer disappears into the system. Liability follows the product, not just the operator. The burden shifts from voluntary best practice to legal exposure.

What is the EU Product Liability Directive’s impact on AI?

The first wave of AI created value for whoever owned the model. The next will create value for whoever can prove the system serves the participant it acts for. Capability is no longer scarce. Distribution is shipping. Alignment is what remains scarce. The next moat is trusted intelligence.

Model performance has converged across frontier labs. Hardware costs for leading systems fell roughly forty per cent in the year to Stanford HAI’s 2025 AI Index. The competitive advantage in capability is compressing every quarter. The competitive advantage in trust has not yet been built.

Alignment is the demonstrable property that a system acts in the interest of the person affected by its decisions. That is the next moat.

What is the next moat in AI?

Aligned Intelligence : Citations

Capability

1. Stanford HAI. “The 2025 AI Index Report.” Stanford University Institute for Human-Centered AI, April 2025. https://hai.stanford.edu/ai-index/2025-ai-index-report

2. VentureBeat. “Open-Source DeepSeek R1 Uses Pure Reinforcement Learning to Match OpenAI o1 at 95% Less Cost.” VentureBeat, January 2025. https://venturebeat.com/ai/open-source-deepseek-r1-uses-pure-reinforcement-learning-to-match-openai-o1-at-95-less-cost

3. MIT Technology Review. “AI Benchmarks Are Broken. Here’s What We Need Instead.” MIT Technology Review, 31 March 2026. https://www.technologyreview.com/2026/03/31/1134833/ai-benchmarks-are-broken-heres-what-we-need-instead/

Why now.

8. OpenAI. “Model Spec.” OpenAI, 2025. https://model-spec.openai.com/

9. Deloitte. “Agentic AI Strategy.” Deloitte Tech Trends 2026, 2025. https://www.deloitte.com/us/en/insights/topics/technology-management/tech-trends/2026/agentic-ai-strategy.html

10. Lynch, S., et al. “Agentic Misalignment: How LLMs Could Be Insider Threats.” Anthropic Research, June 2025. https://www.anthropic.com/research/agentic-misalignment

11. Google DeepMind. “Strengthening Our Frontier Safety Framework” (Version 3.0). Google DeepMind, September 2025. https://deepmind.google/blog/strengthening-our-frontier-safety-framework/

12. European Parliament and Council of the European Union. Directive (EU) 2024/2853 of the European Parliament and of the Council of 23 October 2024 on Liability for Defective Products. Official Journal of the European Union, 23 October 2024. https://eur-lex.europa.eu/eli/dir/2024/2853/oj

The optimisation problem.

13. University College London and University of Essex. “Principal-Agent Dynamics in Large Language Model Agents.” arXiv preprint 2307.11137, July 2023. https://arxiv.org/abs/2307.11137

When trusted roles become software.

14. Quinn, J. “Imposing a Fiduciary Duty on Healthcare AI Developers.” Fordham Law Review, Vol. 94, Issue 1, 2025. https://ir.lawnet.fordham.edu/cgi/viewcontent.cgi?article=6178&context=flr

15. Berz, A. S. “Medical Negligence in the Age of Statistically Superior AI.” Medical Law Review, Volume 34, Issue 1, fwag007, Oxford University Press, March 2026. https://academic.oup.com/medlaw/article/34/1/fwag007/8527108

16. United States Congress. Investment Advisers Act of 1940. 15 U.S.C. § 80b. Securities and Exchange Commission. https://www.sec.gov/about/laws/iaa40.pdf

17. Securities and Exchange Commission. In the Matter of Ally Invest Advisors, Inc., Administrative Proceeding File No. 3-22617, 23 March 2026. https://www.sec.gov/enforcement-litigation/administrative-proceedings/ia-6954-s-0

18. H&R Block. “AI Tax Assist.” H&R Block, 2025. https://www.hrblock.com/tax-center/filing/filing-online/ai-tax-assist/

19. Yahoo Finance. “H&R Block Leverages OpenAI’s Technology to Enhance AI Tax Assist.” Yahoo Finance, September 2025. https://finance.yahoo.com/news/h-r-block-leverages-openai-170000808.html

Incentives.

20. Munger, C. T. “The Psychology of Human Misjudgment.” Speech delivered at Harvard Law School, June 1995. Reprinted in Poor Charlie’s Almanack: The Wit and Wisdom of Charles T. Munger, Walsworth Publishing Company, 2005.

21. Canadian Lawyer Magazine. “The Inherent Client Conflict of Interest Caused by Hours-Based Billing.” Canadian Lawyer, 2023. https://www.canadianlawyermag.com/news/general/the-inherent-client-conflict-of-interest-caused-by-hours-based-billing/269724

22. Allsop, J. (Chief Justice, Federal Court of Australia). “Ethical Issues Posed by Time-Based Billing in Law Firms.” Journal of the Australasian Tax Teachers Association, 20 April 2016. http://www.jatl.org/blog/2016/4/20/ethical-issues-posed-by-time-based-billing-in-law-firms

23. Sidley Austin LLP. “2025 Fiscal Year in Review: SEC Enforcement Against Investment Advisers.” Sidley Austin, October 2025. https://www.sidley.com/en/insights/newsupdates/2025/10/2025-fiscal-year-in-review-sec-enforcement-against-investment-advisers

24. Royal Commission into Misconduct in the Banking, Superannuation and Financial Services Industry. Final Report (Hayne Royal Commission). Commonwealth of Australia, February 2019. https://www.royalcommission.gov.au/banking

The architecture has to be fiduciary.

25. Benthall, S., and Shekman, D. “Designing Fiduciary Artificial Intelligence.” NYU School of Law and Northwestern Pritzker School of Law, arXiv preprint 2308.02435, August 2023. https://arxiv.org/abs/2308.02435

26. Nay, J. “Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans.” Stanford Center for Legal Informatics (CodeX), arXiv preprint 2301.10095, January 2023. https://arxiv.org/abs/2301.10095

27. Riedl, M. O., and Desai, D. R. “AI Agents and the Law.” Proceedings of the 8th AAAI/ACM Conference on AI, Ethics, and Society, 2025. arXiv preprint 2508.08544. https://arxiv.org/abs/2508.08544

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned Intelligence

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

DEFINITION

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned Intelligence

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

DEFINITION

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

The next layer of AI will not only answer. It will Act.

"Who will it act for ?"

The question is:

Aligned

Intelligence

A THESIS FOR THE AGENTIC ERA

Aligned Intelligence

The next layer of AI will not only answer.

It will act.

The question is : Who will it act for ?

Peter Toumbourou

DEFINITION

A new class of infrastructure, organised in three layers; what the system knows about the participant, how it reasons on their behalf, and finally how its loyalty is enforced.

THE INVERSION

Aligned Intelligence

The next layer of AI will not only answer. It will act.

Participants at the Center of Gravity.

Intelligence alone is not sufficient.

Systems must also be aligned.

Aligned Intelligence

The next layer of AI will not only answer. It will act.

Participants at the Center of Gravity.

Intelligence alone is not sufficient.

Systems must also be aligned.

1. Powerful AI is the default

Capability is no longer a scarce resource

The advantage is no longer having access to a powerful model.

Powerful models are the default.

The advantage is building a system people can trust to act in their interests.

2. Why the Next Era is Alignment

A system that answers a question makes a recommendation.

A system that acts makes a decision.

The difference is everything.

When AI only spoke, the worst it could do was mislead.

When AI acts, the participant pays the price for the wrong decision.

This is already happening.

3. The Broken Assumption

That was true while AI was only a tool.

Many of us assumed that smarter AI would automatically lead to better outcomes.

It breaks when AI becomes an agent.

It doesn’t decide who should benefit.

When AI only spoke, the worst it could do was mislead.

When AI acts, the participant pays the price for the wrong decision.

This is already happening.

Intelligence can optimise.

4. Why Now

Four powerful forces, converging at once.

(i) Capability is shipping.

AI is no longer just giving advice on how to fix your fridge.

It is acting where loyalty, care and consequence matter.

(ii) The Rollout has Begun

(iii) Risk is visible.

Aligned
Intelligence

Aligned
Intelligence

Aligned
Intelligence

Many of us assumed that smarter AI
would automatically lead to better outcomes.

Four powerful forces,
converging at once.

Capability is shipping.
The law is catching up.
Risk is visible.

The opening is now.

7. When fiduciaries
become software.

The duty cannot sit beside it.
It has to be built into the chair itself.

The question to ask before letting any agentic system act on your behalf is not how smart it is,
or how accurate nor how fast.

Alignment cannot be a posture or a marketing claim.
It has to be enforceable inside the system.

The companies that define the next decade will not be those that build the most powerful AI.

They will be those that build AI that can be trusted with consequence.