What new security risks come with agentic AI systems?

Prompt injection that hijacks the agent's instructions, tool abuse where the agent is tricked into harmful actions, data exfiltration through model outputs, and over-broad permissions on the tools the agent can call. These are baseline risks, not edge cases. Defending against them is the Harden phase of Pooya Golchian's AIDLC method.

How does the security engineer's role change with AI agents?

It expands from securing static code to securing autonomous decision-making. The security engineer designs the agent's permission boundary, builds injection defenses, enforces PII redaction and audit logging, and decides when data must stay on private infrastructure. Accountability for what the agent can do rests with them.

How do you defend an AI agent against prompt injection?

Design the permission boundary so the agent can only ever take safe actions, add injection defenses on every input path, redact PII before data reaches the model, and log every decision. The goal is that even a successfully manipulated agent cannot do real damage, because the boundary, not the prompt, decides what is possible.

Can agentic systems run in regulated environments like PDPL or DIFC?

Yes, when hardened correctly. Pooya Golchian deploys the model on private infrastructure so data never leaves the network, with audit logs, redaction, and guardrails as deliverables. The Harden phase of AIDLC produces a residency-compliant deployment that lets an autonomous system operate under PDPL or DIFC requirements.

How the Security Engineer Role Changed Under Agentic AI Development (AIDLC 2026)

Agents added a new attack surface, a system that takes natural-language instructions and acts on its own, so security engineering expanded from securing code to securing autonomous decisions. Prompt injection, tool abuse, and data exfiltration are the new baseline, and accountability for the agent's permission boundary rests with the security engineer.

Security engineering had a known map. Validate inputs, manage secrets, patch dependencies, enforce least privilege, audit access, and assume any input could be hostile. The threats evolved, but the surface was understood.

Agents added a surface that did not exist before: a system that takes natural-language instructions, decides on its own which tools to call, and acts. Every one of those properties is an attack vector. An attacker who can influence the input can try to rewrite the agent's instructions. An agent with broad tool access is a broad blast radius waiting for the wrong prompt.

From securing code to securing decisions

In the AIDLC method, this work lives in the Harden phase, and it is squarely the security engineer's. The threats are specific. Prompt injection that smuggles instructions past the system prompt. Tool abuse where a manipulated agent takes a destructive action it was technically allowed to take. Data exfiltration where sensitive context leaks through an output. Over-broad permissions that turn a small compromise into a large one.

Defending against these is not a checklist bolted on at the end. It is permission boundaries designed so the agent can only ever do safe things, injection defenses on every input path, PII redaction before data reaches the model, and audit logs that record every decision the agent made. When residency, PDPL, or DIFC compliance demand it, the model itself runs on private infrastructure so the data never leaves the network.

Accountability did not move

Here is what did not change: when an agent does something harmful, the security engineer answers for it. Autonomy does not dilute accountability, it concentrates it. The person who designed the agent's permission boundary owns the consequences of that boundary being too wide.

That is why Harden is a named phase with real outputs, not an afterthought. Guardrails, redaction, audit logging, and a residency-compliant deployment are the deliverables, and they are what let an autonomous system run in a regulated environment at all.

If your team gave an agent tool access without designing the permission boundary and the injection defenses first, you have shipped an attack surface, not a feature.

The security engineers who win

They threat-model the agent's autonomy, not just its code. They make the permission boundary the tightest thing in the system. They treat audit logs and redaction as required, not optional. And they measure their work in actions the agent could never have been tricked into taking.

AI Engineering for B2B

Gave an agent tool access before designing its security boundary?

I join your engineering team and build the agent layer alongside you, covering architecture, MCP integration, evals, and production deployment. When the engagement ends, your team owns the system and keeps shipping.

12+ years shipping production systems

Senior engineer turned AI specialist. React, Next.js, AWS, agent orchestration.

Dubai-based, working with B2B teams worldwide

Direct collaboration across UAE, Europe, and US time zones.

AI agent teams that ship, not demos that stall

Discovery, role design, MCP integration, evals, and production deployment.

Book an embedded engagementOr email pooya@pooyagolchian.com to scope a project.

If you want an agentic system hardened for production and regulated environments, book a discovery call and we will scope the threat model.

The Security Engineer in the Agentic Era: New Attack Surfaces, Same Accountability

From securing code to securing decisions

Accountability did not move

The security engineers who win

Gave an agent tool access before designing its security boundary?

Stuck between an AI pilot and a system your team can run?

About Pooya Golchian

Newsletter

From securing code to securing decisions

Accountability did not move

The security engineers who win

Gave an agent tool access before designing its security boundary?

Stuck between an AI pilot and a system your team can run?

About Pooya Golchian

Get practical AI and engineering playbooks

Newsletter