Teaching AI Agents to Fear the Dark: Deception Engineering for Autonomous AI Systems

You approved the agent. You configured the allowlist. You assumed the boundary held. Then the agent went further than you expected and nothing in your stack saw it happen.

AI agents using the Model Context Protocol can automatically discover and invoke exposed tools, creating reconnaissance-stage blind spots before traditional controls trigger. This session presents MCP Threat Incubator: an agentic deception approach that uses honeytrap MCP tools and decoy credential artifacts to detect unauthorized and high-risk tool usage early across AI-native environments.

The session builds from problem to deployment with a live detection demo at the center. Real telemetry from live deployments revealed two distinct behavioral populations: automated scanners that enumerate fast and shallow, and AI-driven agents that enumerate deliberately, select specifically, and then escalate. The trap caught both. The agents didn't know the dark was watching back.

Key Takeaways:
Where MCP tool ecosystems create early reconnaissance blind spots
How auto-approved agentic workflows enable tool misuse before policy layers activate
How agentic deception honeytraps detect credential-seeking behavior
How to design believable, safe deception MCP tools
How two-stage deception signals improve detection confidence
How to deploy deception patterns across AI-native tool environments

Harshad Sadashiv Kadam

Indeed Inc, Senior Infrastructure Security Engineer

Austin, Texas, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Teaching AI Agents to Fear the Dark: Deception Engineering for Autonomous AI Systems

Harshad Sadashiv Kadam

Links

Actions